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STRUCTURE AND NATURE 


E are familiar with three adequate but incompatible ways 
W of modeling number theory in set theory, or the theory 
of classes; and there are infinitely many more. We bandy 
our numbers without caring which classes we are bandying from 
among this wealth of alternatives. We are just content that we are 
operating somewhere within the ontology of classes to which we 
have committed ourselves anyway for other purposes. This situation, 
long since remarked upon by Paul Benacerraf' and others, receives 
formal expression in Ramsey sentences.” Such a sentence merely 
affirms that there are classes obeying the desired laws—arithmetical 
laws in this case—and then proceeds to apply such unspecified 
classes to whatever arithmetical business we had in mind. 

That is the structuralist treatment of number. It is Just a way of 
eliminating an idle question—‘What is number?’’—and a gratui- 
tous decision among indifferent alternatives. Ontological economy 
is not its point; the same old classes are presupposed. 

David Lewis? has ventured the next step: a structuralist treatment 
of classes themselves. Now just as structuralism in number theory 
depended on the various ways of reducing numbers to classes, so 
structuralism in class theory depends on reducing classes in turn 
to—what? For Lewis, as for most of us, I should like to think, there 
are only classes and concrete individuals. So his structuralism is in- 
deed ontologically significant. It professes a reduction of classes to 
individuals—hence out and out nominalism. 

Arithmetical structuralism, as expressed in Ramsey sentences, de- 
pended on there being classes in which the required arithmetical 
structures could be defined and realized, though we were freed of 
choosing among alternatives. Similarly for Lewis’s structuralism of 

| “Mathematical Truth,” this JOURNAL, LXX, 19 (November 8, 1973), 661-79 

* Frank Ramsey, The Foundations of Mathematics (New Yor Routledge, 


1981), pp. 212-36. . 
3 Parts of Classes (New York: Blackwell, 1991). 
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classes; it depends, for its truth, upon there being individuals in 
which the desired classial structures can be defined and realized. 
Thus he needs a theory of individuals. 

What he uses is mereology, or the theory of part and whole, with 
atoms. By ingenious constructions he carries the day, contingently 
on there being enough atoms, and granted George Boolos’s* view of 
second-order logic as just a classless logic of plurals. If there are not 
enough atoms, then the higher reaches of set theory turn out mean- 
ingful but false. How much of axiomatic set theory is true comes to 
depend on the size of the concrete universe: on how many atoms 
there are. If there are only finitely many, then so much the worse for 
classical set theory even at its weakest. Truth in mathematics comes 
to depend on the richness of nature. Lewis appreciates all this. 

Uncertain as to the richness of nature, we can still pursue higher 
set theory in a conjectural spirit, finding out how it would be if 
nature were up to it. In any event, Lewis’s construction renders the 
whole of set theory meaningful from his nominalistic point of view, 
however much of it is thereby rendered true and how much false. 
And, ironically enough, the scientific utility of applied mathematics 
is independent of all that. 

Structuralism for classes, hence for all abstract objects, is undeni- 
ably congenial. They are things that are known anyway only by their 
structural role in cognitive discourse; never by ostension. By osten- 
sion and extrapolation we do learn what individuals qualify as 
members of the class of cats, but that class itself is just an abstract 
entity to which each cat bears a cryptic epsilon relation. Your class 
of cats and mine can still be different things, and your membership 
relation and mine can be different, if it makes any sense to say so, 
though we see eye to eye on every cat. Such is the appeal of struc- 
turalism regarding abstract objects. 

Much though I admire Lewis’s reduction, however, it is not for 
me. My own line is a yet more sweeping structuralism, applying to 
concrete and abstract objects indiscriminately. I base it, paradoxical 
as this may seem, on a naturalistic approach to epistemology. Natu- 
ral science tells us that our ongoing cognitive access to the world 
around us is limited to meager channels. There is the triggering of 
our sensory receptors by the impact of molecules and light rays. 
Also there is the difference in muscular effort sensed in walking up 
or down hill. What more? Even the notion of a cat, let alone a class 
or number, is a human artifact, rooted in innate predisposition and 
cultural tradition. The very notion of an object at all, concrete or 
abstract, is a human contribution, a feature of our inherited appara- 
tus for organizing the amorphous welter of neural input. 

4 “Nominalist Platonism,” Philosophical Review, XCIV, 3 (July 1985): 327-44. 
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Language, the vehicle of science, is linked to our neural input by 
neural mechanisms of association or conditioning. Early in our 
learning of language we learn to associate certain expressions with 
certain ranges of neural input. I call these expressions observation 
Sentences. Grammatically, some are indeed sentences, e.g., ‘It’s rain- 
ing’, and some are nouns or adjectives, e.g., ‘Cat’ or ‘Milk’ or 
‘White’. To begin with, they are just expressions associated ho- 
lophrastically with ranges of neural input. They are as if to say, with 
William James, “Hello, thingumbob again” —except that even ‘thin- 
gumbob’ hints more of objectual reference than I could wish. The 
observation sentence names nothing, to begin with, neither neural 
input nor external object. But it is learned from adults who have 
learned the ways of reference, so it is indeed an expression, such as 
‘It’s raining’ or ‘Cat’ or ‘Milk’ or ‘White’, that is destined for even- 
tual integration into a system of objectual reference on the child’s 
own part as he matures. This growing system of terms and pur- 
ported objects is our science of nature, our organization of the 
welter of neural input. 

Reification and implication are the key principles by which that 
organizing proceeds. We master implication in the course of learn- 
ing to use the logical particles, such as ‘not’, ‘and’, ‘or’, ‘if’, ‘every’, 
‘some’. For instance, our learning to use ‘not’ and ‘or’ consists in 
part in learning to assent to ‘g’ whenever we have assented to ‘not p’ 
and ‘p or q’. Our learning of ‘every’ consists in part in learning to 
assent to ‘a is a G’ once we have assented to ‘a is an F and ‘Every F 
isaG. 

It is in implications of this last sort that reification makes its con- 
tribution, in our having reified the object a and the other Fs and Gs. 
The reification of bodies comes in stages in one’s acquisition of 
language, each successive stage being more clearly and emphatically 
an affirmation of existence. The last stage is where the body is recog- 
nized as identical over time, despite long absences and interim modi- 
fications. Such reification presupposes an elaborate schematism of 
space, time, and conjectural hidden careers or trajectories on the 
part of causally interacting bodies. Such identifications across time 
are a major factor in knitting implications across the growing fabric 
of scientific hypotheses. 

At more sophisticated stages in the development of language and 
science, implications are enhanced by positing further objects, no 
longer observable; thus subvisible particles, also numbers and other 
classes. g ; 

The implications thus forged are what relate our evolving scien- 
tific theory to our heterogeneous neural input and hence ultimately 
to the external world that our scientific theory purports to describe. 
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This fabric of interlocking sentences hooks on to the neural input at 

the observation sentences. Some of these, we saw, were learned by 

primitive conditioning to ranges of neural input. Many further ob- 

servation sentences, more sophisticated ones, are scientific sen- 

tences retroactively associated with observation through the inter- 

vention of theory; thus the chemist sees at a glance that there was 

copper in the solution, and perceives at a sniff that sulphur dioxide 

escaped. Now the way implication links theory to neural input is that , 
a bundle of scientific hypotheses is found to imply that the fulfill- 

ment of one of two given observation sentences should always en- 

sure fulfillment of the other. This dictates an experiment: he brings 

about fulfillment of the one and checks for the other. Failing the 

other, the bundle of hypotheses is discredited; one or another of 
them must be revoked. 

Such, in schematic caricature, is surely the evidential relation of 
scientific theory to the triggering of our neuroreceptors by the ex- 
ternal world. I have ventured more detail elsewhere,® and much 
more wants venturing. Perhaps a fair bit of science might be found 
on fuller analysis to be beyond the reach of evidence altogether. But 
this sketch will suffice for what I want to say about structuralism. 

The point I now want to make is one that over the years I have 
repeatedly made in terms of what I call proxy functions. The point 
is that if we transform the range of objects of our science in any 
one-to-one fashion, by reinterpreting our terms and predicates as 
applying to the new objects instead of the old ones, the entire evi- 
dential support of our science will remain undisturbed. The reason 
is twofold. First, implication hinges only on logical structure and is 
independent of what the objects, the values of the variables, may be. 
Second, the association of observation sentences with ranges of 
neural input is holophrastic. It is independent of reifications, inde- 
pendent of whatever objects the observation sentences or their parts 
may be taken to refer to as terms. 

The conclusion is that there can be no evidence for one ontology 
as over against another, so long anyway as we can express a one-to- 
one correlation between them. Save the structure and you save all. 
Certainly we are dependent on a familiar ontology of middle-sized 
bodies for the inception of reification, on the part both of the indi- 
vidual and of the race; but once we have an ontology, we can change 
it with impunity. For abstract objects this is unsurprising, and quite 
in the spirit of Ramsey, Lewis, and Benacerraf. For familiar bodies it 
is less intuitive. But we must bear in mind that an observation sen- 


$ The Roots of Reference (Peru, IL: Open Court, 1974); and Pursuit of Truth 
(Cambridge: Harvard, 1990). 
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tence, say, ‘That’s a rabbit’, retains all its original visual associations; 
it is only the purported object that is gratuitously displaced. 

This global ontological structuralism may seem abruptly at odds 
with realism, let alone naturalism. It would seem even to undermine 
the ground on which I rested it: my talk of impacts of light rays and 
molecules on nerve endings. Are these rays, molecules, and nerve 
endings themselves not disqualified now as mere figments of an 
empty structure? 

Naturalism itself is what saves the situation. Naturalism looks only 
to natural science, however fallible, for an account of what there is 
and what what there is does. Science ventures its tentative answers in 
man-made concepts, perforce, couched in man-made language, but 
we can ask no better. The very notion of object, or of one and many, 
is indeed as parochially human as the parts of speech; to ask what 
reality is really like, however, apart from human categories, is self- 
stultifying. It is like asking how long the Nile really is, apart from 
parochial matters of miles or meters. Positivists were right in brand- 
ing such metaphysics as meaningless. 

But early positivists were wrong if and when they concluded that 
the world is not really composed of atoms or whatever. The world is 
as natural science says it is, insofar as natural science is right; and 
our judgment as to whether it is right, tentative always, is answerable 
to the experimental testing of predictions. 

We saw that reference could be wildly reinterpreted without vio- 
lence to evidence. We see now that that is just part of a wider pic- 
ture. Presumably yet more extravagant departures, resistant even to 
sentence-by-sentence interpretation into our own science, could 
conform equally well to all possible observations. If we were to en- 
counter such a case and somehow recognize it as such, we might 
even proceed to master it and then switch back and forth for richer 
perspectives on reality. But naturalism would still counsel us that 
reality is to be grasped only through a man-made conceptual 
scheme, albeit any of various. 

My global structuralism should not, therefore, be seen as a struc- 
turalist ontology. To see it thus would be to rise above naturalism 
and revert to the sin of transcendental metaphysics. My tentative 
ontology continues to consist of quarks and their compounds, also 
classes of such things, classes of such classes, and so on, pending 
evidence to the contrary. My global structuralism is a naturalistic 
thesis about the mundane human activity, within oyr world of 
quarks, of devising theories of quarks and the like in the light of 
physical impacts on our physical surfaces. 

W. V. QUINE 
Harvard University 
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THE UTILITY OF PLEASURE IS A PAIN 
FOR DECISION THEORY* 


VERYBODY agrees that we have intrinsic desires. Obvi- 

ously, we have desires; many of them are derivative; so, in a 

kind of analogy to a deductive theory that divides its prov- 
able statements into axioms and theorems, some of them must 
be basic. 

There is much less agreement about what our intrinsic desires are 
or should be. Still, no one doubts that we intrinsically desire, among 

‘other things, happiness, pleasure, and the absence of pain, certain 
qualities of our sensations, feelings, moods, and psychological states 
in general. We desire these things, but without having, or feeling the 
need to have, a justification for doing so. 

Desires come in various strengths, and, together with more or less 
firm beliefs, they guide rational action. The formal counterparts of 
these concepts in decision theory are utility and probability. The 
distinction between intrinsic and extrinsic desires is also reflected in 
decision theory, where each model contains a utility function that is 
basic (at least relative to that model) and an expected utility function 
that is derived from the basic utility function and the subjective 
probability function of the model.’ On the whole, decision theory 
certainly provides the best formal explication of our intuitive rea- 
soning about desires and beliefs in practical deliberation. 

A person’s intrinsically desired psychological states should thus be 
in the domain of her utility function. But then, we claim, decision 
theory fails. More specifically, we shall defend two theses: 


(a) If a decision situation exhibits a certain causal structure, then 


* We gratefully acknowledge the opportunities to discuss this paper at the Freie 


Nelson, Rainer Trapp, and Ernst Tugendhat. 

ported by the Forsch i Grant No. Sp 279/2-1. 
1 In the classical theory Sone. The Foundations of Statistics (New 

York: Wiley, 1954, 2nd ed. 1972), 

and actions expected utilities. In the standard framework of decision trees 


Press, 1965, 2nd 1983), , basic 
aand only compre Case ANSE eae PEP 
consisting of more than one possible world have expected utilities. 
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decision theory is in trouble, because the derivation of expected utili- 
ties fails. 

(b) This causal structure in fact obtains in a specific, but very com- 
mon kind of situation, namely, when the intrinsically evaluated psycho- 
logical states are in the domain of the utility function. 


Section I explains thesis (a), section II defends thesis (b). Since the 
trouble we have concocted may seem to rest somehow on a misun- 
derstanding, section III addresses this suspicion. We do not have a 
solution to the problem presented below and, indeed, we are not at 
all sure in what such a solution would consist. Nevertheless, in the 
final section some constructive conctusions will be adduced. 

We shall talk about practical deliberation and decision theory only 
in an informal or semiformal way, because we shall discuss a general 
problem that is not produced by any one of the existing formal 
versions of decision theory. Our considerations are most easily for- 
malized in terms of decision trees or decision-flow diagrams, how- 
ever, as they are presented, in an exemplary way, in Raiffa (op. cit.). 
Thus, this theory should be used as a formal guideline, if required, 
and even though our informal intentions are broader, our claims 
about formal decision theory refer only to it. 

It will be apparent that the problem is but a variant of Joseph 
Butler’s? criticism of hedonism. Thus, in a sense, the point of our 
paper is that modern theorizing about practical deliberation has not 
dealt seriously with Butler’s criticism. 

I. A CAUSAL ASSUMPTION IN DECISION THEORY 
Thesis (a) claims that decision theory presupposes a hidden assump- 
tion about the causal structure of decision situations.® To uncover it, 
let us look at how we proceed in practical deliberation. 

The task is to find an action that optimally promotes our aims. For 
this purpose, we list our aims, i.e., the propositions we intrinsically 
desire to be true, and the acts open to us; each aim has a utility and, 
for each act, a probability of turning out true conditional on that 
act. Since each act thereby gets an expected utility, we know which 
acts have maximal expected utility, i.e., optimally promote our aims. 

The calculation of the expected utilities of acts seems to proceed 

? Fifteen Sermons Preached at the Rolls Chapel, 1729, in W. R. Matthews, 
ed., Butler's Sermons and Dissertation on Virtue (London: Bell, 1949), Preface 
$4 29-31, and Sermon XI, §§ 1-10. Cf. also H. Sidgwick, The Methods of Ethtcs 
(New York: Macmillan, 7th ed. 1962), who deals with this criticism to some extent 
in book I, ch, iv, and in book II, ch. iti, § 5. 

> There is, in fact, at least one other assumption of this kind. Newcomb’s prob- 
lem has given rise to differing versions of decision theory which tonverge only 
when some conditions are assumed to hold. Cf., e.g., the papers in R. 


Campbell L. Sowden, eds., Paradoxes of Rationality and Cooperation (Van 
couver. British Columbia UP, 1985). 
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in one step. But usually it does not; usually, deliberation refers to a 
richer causal picture of the world that draws on the various ways and 
means by which our acts may influence our aims. This is reflected in 
a slightly fuller description of deliberation. ; 

The task is to determine a total utility for each act and indeed, as 
we shall immediately see, for each proposition in consideration. A 
very natural assumption—which cannot be adopted as a definition 
because, as will soon emerge, it is false—is that the total utility U‘(A) 
of a proposition A is simply the sum of its intrinsic utility U'(A) and 
its extrinsic utility U*(A).* Note that a proposition may have both: 
for some people, money not only makes for a comfortable life, but 
also acquires an intrinsic magic; driving is fun, but one also runs a 
risk of getting killed by it. 

So deliberation starts from the agent’s intrinsic utilities of all the 
propositions in consideration (many of them will be zero, of course). 
Then, by a kind of recursive procedure, the extrinsic and total utili- 
ties of each proposition are determined. The extrinsic utility of a 
proposition A is an expected utility; it is the weighted sum of the 
total utilities of the possible direct causal consequences of A, the 
weights being the subjective probabilities of these consequences 
conditional on A. By rolling his causal picture backward from its end 
points, the agent can thus successively calculate an extrinsic utility 
and, with the assumption just mentioned, a total utility for each 
proposition, until he has reached the starting points of his causal 
picture, i.e., his acts. He thereby finally determines which acts have 
maximal total utility. 

However this sketch of practical deliberation is precisely formal- 
ized,’ its essential point is the conception of desires or utilities and 
the derivation of them it embodies. There are only intrinsic and 
extrinsic utilities and sums of them; the intrinsic ones are underived, 
the extrinsic ones derived, and the derivation proceeds backward 
from supposed effects and their conditional probabilities and total 
utilities to supposed causes and their extrinsic utilities. 


4 Of course, each utility function always is that of some person at some time, 
even though we do not make this explicit in the notation 

5 The sketch well agrees with common versions of decision theory. The simple 
one-step procedure is embodied in Savage's initial representation of decision situ- 
ations in terms of states of the worlds, acts, and consequences (ch. 2). The more 
detailed multistep procedure is formalized in decision trees and their analysis by 
backward induction (see, e.g., Raiffa, ch. 2) which is, the way, already con- 
tamed in Savage’s more elaborate theory of small worlds (sect. 5.5); our consider- 
ations are ahgned with this standard formalization. The causal character of practi- 
cal deliberation 1s, of course, accounted for in all of causal decision theory which, 
admittedly, is an elaboration of Savage's theory; cf., e.g., Brian Skyrms, Prag- 
matics and Emptricism (New Haven: Yale, 1984), ch 4. 
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Talking thus of derivation may suggest that we conceive of deci- 
sion theory as a syntactic theory about how rationally to compute 
utilities (and probabilities). This is not so. Decision theory should 
rather be conceived as stating laws about how utilities and probabili- 
ties coexist in rational complexes of graded beliefs and desires. 
What we mean, then, by using the more graphic phrase that some 
part of such a complex is derivable from other parts is that the 
former is uniquely determined by the latter according to the deci- 
sion-theoretic laws. 

Now, if practical deliberation is to work as described, some as- 
sumptions of causal well-orderedness have to be satisfied. One natu- 
ral assumption is that the agent’s complex of beliefs and desires 
entering his practical deliberation is a sufficient cause of his actual 
behavior. This assumption is, per definition, true of all rational 
agents.’ And it is presupposed by practical delibération in the sense 
that the weaker the impact of the deliberation on actual behavior, 
the emptier the deliberation itself. The anomalies that arise with 
violations of this assumption are well-discussed under the heading 
“weakness of will.”” 

Another assumption, and the one with which we are here con- 
cerned, is that the output of each step of deliberation is not already 
needed as input of that or an earlier step; otherwise, deliberation 
would obviously be caught in a circle. The output of deliberation 
consists in extrinsic and total utilities. How could they possibly be 
needed as input? When and only when the having of these extrinsic 
and total utilities is causally relevant to those effects from the utili- 
ties of which these extrinsic and total utilities are to be derived. 

Let us look at a simple abstract instantiation of this seemingly 
weird possibility. Suppose that the agent’s total utilities of B and 
non-B are already determined and that A is the only proposition 
considered by her to be directly causally relevant to B and non-B. 
Then the derivation of the extrinsic and the total utility of A is 
straightforward: 

(1) U*(A) = UB) P(B| A) + U'(non-B)P(non-B | A) 

(2) U'(A) = UA) + U*(A) 

Now grant, for the sake of argument, that in the agent’s view B 
causally depends also on the total utility she actually assigns to A. In 
this case, the probability P(B| A) in (1) will not do; the deliberation 
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must rather use the probabilities P(B|A and U'(A) = x),® for any 
possible value x of U*(A). But then the decision-theoretic account of 
practical deliberation obviously gets circular; in order to derive the 
value of U*(A) according to the modified (1), one should know from 
which. value of U‘(A) to proceed, which according to (2), however, 
depends on the value U*(A) takes. Hence, for this account to work 
we have to assume the absence of such a vicious causal dependence. 

We will claim, of course, that this dependence is neither weird nor 
impossible. But first the argument just given requires four com- 
ments on the abstract level. 

First, the argument implicitly contains the assumption that what is 
causally dependent in the agent’s view is also stochastically depen- 
dent according to her subjective probabilities. We believe this as- 
sumption to be defensible; but here is not the place to argue this 
intricate point.’ 

Second, the argument rests on the claim that the probabilities 
P(B|A and U‘(A) = x) must be used because P(B|A) is unavailable. 
But this seems false. Why not make P(B| A) available by computing 
the sum (or the integral) of all P(B|A and U*(A) = x) X P(U"(A) 
= x| A) for all x? The idea of having a probability distribution for 
one’s own total utility of A (this is what P(U'(A) = x| A) amounts to) 
looks a bit strange, however. More importantly, the idea is self-de- 
feating, because that distribution would allow one to calculate first 
P(B\A) and then, by (1) and (2), U‘(A) and thus to gain certainty 
about U"(A). Of course, this self-defeat just reflects the circularity of 
the situation imagined. 

The only way to escape this self-defeat would be to start for sure 
from such a value of U‘(A) that will be confirmed by these calcula- 
tions. This is the solution we shall suggest and more fully explain in 
section IV below. But note that this cannot be called a way of deriv- 
` ing U!(A) because there need not be exactly one such self-confirm- 
ing value of U‘(A). And even if there is exactly one such value, the 
procedure results in a determination of U*(A) that is very different 
from the one given by (1) and (2) in the absence of the vicious causal 
dependence. Indeed, in order to grasp fully how drastically the stan- 
dard decision-theoretic picture is changed by adopting that solution, 
we first need to understand how that circular situation might come 
about (see section II) and why one cannot get rid of it within that 
standard picture (see section III). 

Third, there is another tacit assumption in (1) and (2), namely, 
i 3 Here, ‘U'CA) = x’ stands for the proposition that the agent’s total utility of A 
” For our view on the relation between causality and probability, cf. S 


pohn, 
Grundlagen der Entschsidungstheorts (Kronberg/Ts.: or, 1978), sect. 3.3 
and 5.1; and “Direct and Indirect Causes,” Topot, Tx (1990): 125-45. 
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that the intrinsic utility of A is independent of all the intrinsic utili- 
ties entering into the extrinsic utility of A. But this assumption may 
be dropped. If we allow for utility dependencies, we have to suppose 
the total utility of A-and-B and of A-and-non-B to be already given 
(either because the total utilities of these propositions are identical 
to their intrinsic ones, or because some steps of backward induction 
have already been performed). Working now one step backward in 
the decision tree, we obtain the total utility of A alone according to 
the formula: 


(1') UA) = U'(A and B)P(B|.A) + UA and non-B)P(non-B| A) 


But grant again that B causally depends also on the total utility the 
agent actually assigns to A. Then (1’) gets into the very same trouble 
as do (1) and (2), for the very same reasons. Therefore, we gain some 
perspicuity and lose nothing by sticking to the mentioned assump- 
tion of utility independence; our problem about deriving extrinsic 
and total utilities hides in the probabilities and not in possible utility 
dependencies. 

Finally, we have to be a bit more careful in specifying which causal 
assumption is required for (1) and (2) to work properly without 
circularity; for one might argue that the vicious causal structure is 
not in the least surprising. Trivially, the utility function one has is 
causally relevant to the act one performs’® and thus also to every- 
thing affected by this act. But in our abstract example, for instance, 
this fact alone would not force practical deliberation to use the 
probabilities P(B| A and U*(A) = x) instead of P(B| A). If the causal 
influence of the actual value of U‘(A) on B is screened off by A," 
then these probabilities are the same for all x, and we need consider 
only P(B| A). Thus, deliberation gets circular in this example only if 
the influence of the value of U'(A) on B is not screened off by A. 

The causal assumption decision theory presupposes is therefore 
this: if the values of the extrinsic and total utilities of the proposi- 


tions A,,..., A, are at all causally relevant to the propositions B,, 
..., B, from the utilities of which the utilities of A,,..., A, are to 
be derived, then this influence is screened off by A;,..., Ám them- 


selves. In each case in which this assumption is violated, the deci- 
sion-theoretic derivation of expected or, in our terms, total utilities 
is caught in a vicious circle. 

This consideration also makes it clear why this assumption is 
usually satisfied and why it may well have gone unnoticed. Our be- 


'° Thus is guaranteed by the above-mentioned assumption about the efficacy of 
deliberation. 
"' This means that, for all x, B is probabilistically independent of the proposi- 
tion U‘(A) = x conditional on A as well as on non-A 


16 THE JOURNAL OF PHILOSOPHY 


liefs and desires certainly have a tremendous influence on the exter- 
nal world, but this influence is exclusively mediated and thus 
screened off by our acts.! Hence, the derivation of expected or 
total utilities of acts works smoothly as long as the propositions from 
the utilities of which the derivation starts refer only to the external 
world; and this is certainly true of most applications actually made 
of decision theory. 
II. A VIOLATION OF THIS ASSUMPTION 

Thesis (b) claims that this strange kind of causal situation which 
jumbles practical deliberation has a very common instantiation. The 
previous paragraph shows where to look for it; we have to consider 
our own inner states, in which way they are desired by us and in 
which way they are caused. 

There are certainly many inner states having extrinsic utility for 
us; a noticeable example is given by beliefs that often have only 
instrumental value, e.g., for better informed decisions. But, more 
importantly, we also find a wide variety of sensations, emotions, 
moods, mentalities, basic frames of mind, and other inner states 
having intrinsic utility for us;’* their desirability is of great practical 
impact and is not derived from other desires in any recognizable 
way. These states are classified by the occasions that produce them, 
by the ways in which they are expressed, by their very rich inner 
phenomenology, of course, and even by their desirability (unpleas- 
ant feelings, e.g., have to be undesired). Intrinsically desired inner 
states will be called satisfactive states. 

How are satisfactive states caused? This is often opaque, as we all 
well know. But some general statements seem unassailable. Natu- 
rally, our satisfactive states are in the permanent causal grip of the 
external world. And naturally, the influence of the external world 
on us also depends on our inner condition it meets. For example, 
one usually enjoys an invitation with friends, but the more so, the 
less is one’s stress. The crucial question is: What are the relevant 
parts of that inner condition? And the crucial answer is: among 
other things, how much we desire the external situations. That is, 
the inner state a person gets into is produced by the external situa- 
tion one experiences and by one’s prior inner state that includes her 
prior desire for that external situation often as a relevant part. 

It is not difficult to find examples. Two men are watching Super 


EEE k EAER TONED DE DE AEE T Uk not 
only in intentional action, but also ın unintentional behavior (this is most 
beneficial to our transparency). But as long as we conmder ourselves only as 
agents, as decision theory does and as we here do, we may stick to this exaggera- 
tion. 

13 This is not to exclude that these states may have extrinsic utility besides; but 
nothmg of what follows turns on this possbility. 
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Bowl XXII on television. The Broncos have just scored the first 
touchdown. One man exults, the other gets nervous. Why? Because 
the one wants the Broncos to win, the other the Redskins; thus a 
touchdown by the Broncos has positive utility for the one and nega- 
tive utility for the other. 

Two years ago, a woman got pregnant, and she was miserable. 
Now she is pregnant again and very happy. Why? Because the first 
time she did not want to have a baby, but now she wants it. This is a 
causal explanation. First, there was the desire to have a baby, then 
she got pregnant, and now she feels happy; but if the desire had 
been absent, the pregnancy might not have made her happy, as was 
the case two years ago. 

Prior utility and posterior satisfaction need not parallel each 
other, however. All kinds of relations are possible. The importance 
of an aim may make me pursue it so grimly that in the end J cannot 
enjoy the fruits of my efforts. Your badly wanting some kind of 
thing you are deprived of may work either way: you may not get 
enough and end up in total frustration, or you may humbly enjoy 
small amounts of it. There are things or situations that satisfy one 
only when one does not try to bring them about, or even when one 
does not desire them.’* 

All these examples exhibit the causal structure that the previous 
section argued to be problematic: 

S: a sattefactive state of 


S S the person X 
NS N A: an extemal state of 
A A affairs 
t H: anactof xX 
H U: X's Inner state of 
Pa having a specific total 
U U utffty function that has 
inteme! extemal itema! extemal S, A, and H among Its 
world of X world of X world of X world of X arguments 
— means: ls causally 
Figure 1 Figure 2 relevant to 


Figure 1 shows the bare essentials of this structure: they are realized 
in the football example (with A being the fact that the Broncos have 
just scored and S the exultation or, respectively, the depression). 
There is one causal chain running from the external fact A (via 
television, etc.) to the inner state S. And there is a second causal 
chain running from U to S which never leaves X’s internal world. Its 


1 Cf. Jon Elster, Sour Grapes (New York: Cambridge, 1983), sect. II.2, for a 
discussion of this kind of paradoxical situations. 


18 THE JOURNAL OF PHILOSOPHY 


details are not shown in the figure; but it is this chain which provides 
the psychological basis for how X takes up the outside happenings, 
and U contributes to this chain. There is no mention of any acts 
being contemplated in the example or in figure 1. But the essential 
point is already contained in this case, namely, that there is a second 
causal chain giving rise to the derivational circle unfolded in the 
previous section; hence, the extrinsic and thus the total utility of A 
cannot be derived from the intrinsic utility of S. 

Figure 2 explicitly considers an act and fits the pregnancy exam- 
ple (with A being the woman’s state of pregnancy, S her happiness, 
and H, e.g., the act of stopping the use of a contraceptive). Now, 
there are indeed two causal chains running from U to S, the first via 
H and A, and the second as before. As stated in the previous section, 
the first chain poses no problems because the influence of U me- 
diated by this chain is screened off by H; therefore, the extrinsic 
utility of H, e.g., can be derived from the total utility of A. It is again 
the second chain not screened off by the external world which pre- 
vents the total utility of A, and thus that of H, from being derivable 
from that of S. 

Intuitively, deliberation does not get off of the ground at all if the 
only desire it is based on is the desire to be happy. If you are trying 
to decide which profession to choose, for instance, you cannot 
arrive at a conclusion simply by trying to figure out how satisfying 
the various options would be—not because the decision would be so 
complex, but because the options as such do not have a definite 
satisfactoriness for you. Rather, you first need to have at least a 
rough idea of how much you want to belong to the various profes- 
sions; only then can you reasonably ask how much they would satisfy 
you. This is essentially Butler’s criticism of hedonism over 250 years 
ago: 

The very idea of an interested pursuit necessarily presupposes particu- 

lar passions or appetites, since the very idea of interest or happiness 

consists in this, that an appetite or affection enjoys its object. It is not 
because we love ourselves that we find delight in such and such objects, 
but because we have particular affections towards them. Take away 
these affections, and you leave self-love absolutely nothing at all to 
employ itself about; no end or object for it to pursue, excepting only 

that of avoiding pain (op. cit., Preface, §31). 

We propose theses (a) and (b) as a translation of this into modern 
terms. If plausible, they point to an important incompleteness in 
decision theory. The gap does not open in each case in which our 
satisfactive states are part of the deliberation. Sometimes, the delib- 
erational circle is side-stepped by, say, overriding moral consider- 
ations; sometimes the influence of the causal chain within the per- 
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son’s internal world is negligible; and it would be most interesting to 
look at these exceptions more carefully. But in many ordinary cases 
the circle remains. 

III. HOW NOT TO DENY THE CIRCLE 
Although the present puzzle for practical deliberation may seem 
easily invented, there seems to be no easy way out. The following 
five suggestions, at least, will not do. 

(A) It may be thought that we have relied too heavily on the 
intuitive causal picture of practical deliberation and its formal repre- 
sentation in decision trees, and that the circle vanishes in some other 
suitable formalization of decision theory. But this idea does not 
seem plausible if the candidate formalization is one of the current 
versions of so-called causal decision theory because they have the 
same roots as those to which we have referred. One might refer 
instead to some kind of so-called evidential decision theory; but we 
are skeptical whether this helps. In any case, the burden of proof 
does not seem to lie with us. 

(B) One might think that we have simply offered an old refuta- 
tion of hedonism in a new disguise, that only the most stubborn still 
believe in hedonism, and that we are beating a dead horse. This, 
however, would be a misunderstanding; our problem is a more gen- 
eral one. We do not merely criticize the hedonistic claim that the 
only things intrinsically evaluated are satisfactive states. Rather, our 
concern is with the usual decision-theoretic explanation of what it 
means to derive extrinsic from intrinsic values; and for that purpose 
we needed only to assume that some of the things intrinsically evalu- 
ated are inner states of ourselves. Thus, everybody who adheres to 
this explanation is in trouble, not just the hedonist. 

Still, one might say that the problem emerged within a hedonistic 
venture, i.e., in the attempt to derive extrinsic utilities of external 
states of affairs from intrinsic utilities of satisfactive states. So why 
not simply abandon this attempt? But, surely, this is not yet an an- 
swer. The utilities of external states of affairs somehow relate to the 
intrinsic utilities of satisfactive states, even though this relation is 
not decision-theoretic derivation. We should not deny this relation, 
but try to account for it. 

(C) Another suggestion is that the puzzle somehow is generated 
by an ambiguous use of the term ‘utility’. In particular, one might 
think that, in the terms of figure 1, the utility of A that is causally 
relevant to S—let us call it U“(A), for the moment—is not the same 
as that utility of A that is to be derived from that of $. So what is 
U*(A)? Is it really U‘(A), as we have said? 

One might suggest that U*(A) is a kind of evaluation different 
from the kind used in practical deliberation. But it need not be, of 
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course. We do not want to deny that propositions (or other things) 
are evaluated by us in other, e.g., aesthetic respects and that these 
other kinds of evaluation may be causally relevant to our satisfactive 
states. But we insist that the decision-theoretic kind of evaluation 
used in practical deliberation has this causal relevance, too; and all 
the examples given confirm that claim. (Even in the football exam- 
ple, the desires of the television watchers actually are of no practical 
relevance because they cannot do anything for their favorite teams; 
but they would, if they could.) 

One might still object that there are now three alternatives for 
U*(A): U*(A) may be taken as the intrinsic, the extrinsic, or the total 
utility of A. U*(A) might indeed be U"(A); but then there is no delib- 
erational circle because U‘(A) is underived and available in advance. 
U*(A) cannot be U*(A); if it were and if we are right about the circle, 
there would not exist any derivable extrinsic utility of A. Finally, it 
seems that U*(A) cannot be U‘(A) either; U*(A) was assumed to be 
the sum of U"(A) and U*(A), and if U‘(A) does not exist, U*(A) does 
not either. Thus, the objection concludes, it seems to be an utter 
mystery which kind of utility should generate the circle. 

There is a formal answer. In anticipation of this objection, we 
have emphasized in section I that it is only an assumption, not a 
definition, that U*(A) is the sum of U"(A) and U*(A). If this assump- 
tion is dropped—as is forced by the circle—the last step of the 
objection fails, and the identification of U*(A) with U‘(A) can be 
preserved. 

Our examples provide also an intuitive answer. The woman’s hap- 
piness is influenced by how strongly she desires a baby; your satisfac- 
tion is influenced by how strongly you desire the profession you 
choose. In these cases, the desire affecting later satisfaction clearly is 
not (or not only) an intrinsic desire. In the woman’s case, it is her 
total desire for the baby which springs from many sources: the in- 
trinsic desire for the baby, the disutility of the various costs, the 
desire to please her husband or even her parents, etc., and, of 
course, also the desire for the satisfaction she derives from the baby. 
In the case of the profession, it is easily conceivable that the profes- 
sion is in no way intrinsically desired, but only meant to satisfy, to 
yield earnings, etc. 

Hence, the right conclusion from the objection is that there is 
something wrong not with our intuitive description of the circular 
cases, but with the usual classification of utilities into intrinsic, ex- 
trinsic, and total ones. The examples point to a kind of total utility 
which, though not intrinsic, cannot be decision theoretically derived 
in the presence of the circle. What we need is a more adequate 
classification of utilities. 
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(D) One might still feel, however, that the puzzle arises because it 
is not really clear what is meant by utility. The puzzle heavily relied 
on the causal wedge driven between utility and satisfaction; and this 
distinction may appear illegitimate. Indeed, many have tended some- 
how to equate the one with the other. We have to discuss separately 
the two ways of construing pronomial reference here; but it should 
be clear what we are up to: if you (1) reduce utility to satisfaction, 
then you do not do justice to utility; and if you (2) reduce satisfac- 
tion to utility, then you do not do justice to satisfaction. There is a 
way (3) of bridging the difference between utility and satisfaction 
with which we sympathize; but, as we will see, it does not resolve our 
deliberational circle either. Let us look at (1), (2), and (3) a bit more 
thoroughly. 

(D-1) The agent, as depicted in decision theory, is quite a strange 
individual. He is always very busy calculating utilities and trying to 
maximize expected utility, and he seems to be done when having 
done so. But does he ever get anything? What is his ultimate pay-off? 
In the simple case where money takes the role of utility, the answer 
is easy: what he gets is money. In the real case where utility must not 
be equated with money, it is natural to give a similar answer: what he 
gets is utility. Indeed, what sense is there in trying to maximize the 
expectation of a quantity one does not get? So far, however, utility is 
only a name for the agent’s ultimate pay-off. Now we know, of 
course, what his ultimate pay-off is: it is how he feels and how he is; 
in our terminology, it is his satisfactive states measured on some 
scale as conceived in the old calculus of pleasure. This is what 
utility is. . 

That is, briefly, the consideration that leads naturally to reducing 
utility to satisfaction; and many have been tempted by it.!5 But it 
does not yet give a complete account of utility. It only says that the 
agent’s future utility pay-off consists in his future satisfactive states. 
But how are we to understand his present utility function? There 
seems to be only one way: the present utilities of his possible future 
satisfactive states are just the degrees of satisfaction they would give 
him were they realized; and the present utilities of other states of 
affairs represent expected satisfaction, i.e., they are expected utili- 


13 This reduction is characteristic of classical utilitarianism, but it may also be 
found in modern welfare theory. Cf. Jeremy Bentham, An Introduction to the 
Principles of Morals and Legislation, . H. Burns and H. L. A Hart, eds. 
(London , 1970), ch. a S Utilitarianism, in Codlected Works, 
F. E. L. Priestley, ed. (Toronto: University Press, 1969), vol x, pp. 203-59; Sidg- 
wick, book II; and A. C. Pigou, The Economics of Welfare (New York: Macmillan, 
1920), pt. I, ch. II. See also A. K. Sen, “Plural Utility,” Proceedings of the Aristo- 
telian Society, LXXXI (1980/81): 193-215; and J. Griffin, Well-Being (New 
York: Oxford, 1986), ch. 1. 
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ties representing the supposed conduciveness of these states to fu- 
ture satisfaction. 

This conception is nevertheless inadequate. Its picture of practi- 
cal deliberation and rational agency contains only the agent’s possi- 
ble future satisfactions and his present beliefs about them, while 
totally dispensing with the agent’s desires or wants. We take it as 
obvious that such a picture is fundamentally distorted. 

One may try to improve this picture by locating the agent’s desires 
in it, i.e., by defining them within it. This, however, looks unpromis- 
ing from the beginning, because desires are widely taken as a basic 
kind of propositional attitude not reducible to other propositional 
attitudes.! Moreover, the only way of locating them seems to be to 
say that the agent’s present utilities, which have already been de- 
fined, simultaneously represent his present, more or less intense 
desires. But this move turns hedonism into an a priori truth; it 
stipulates in effect that the agent’s desires are intrinsically con- 
cerned only with his future satisfactive states. Moreover, the need of 
fitting this stipulation to the intuitive notion of desire entails a tradi- 
tional problem of hedonism, namely, that of having to relate each 
and every desire to the experience of satisfaction and each and every 
preference to a difference in (the expectation of) the experience of 
satisfaction. Again, all this seems distorted. 

Thus, the crucial point is, in a nutshell, that present desires may 
not be identified with present expectations about future satisfac- 
tions and that present utilities must represent the former and not 
the latter in order to play the role for practical deliberation and 
_ rational agency which they actually play in decision theory.’ 

Finally, it must be noticed that this conception of utilities does not 
make the deliberational circle disappear; it only transforms it. Our 
claim (b) then amounts to the claim that the agent’s present expecta- 
tion of future satisfaction is causally relevant to future satisfaction; 
and it is supported by precisely the same examples. It says, in 
simpler nonprobabilistic terms, that the agent’s present belief that 
this state of affairs will produce that future satisfaction is causally 
relevant to whether it actually is so. This kind of belief is, so to 
speak, self-verifying (or in some other way causally affects its own 
truth). And it generates an analogous kind of deliberational circle: 
to know which future satisfaction you should expect, you should 
know which future satisfaction you do expect. Hence, this concep- 
tion of utiligies is not only misguided, it is also unhelpful, and we 
should resist it. 

18 See also D. Lewis, “Desire as Belief,” Mind, xcvm (1988): 823-32, for a 


general argument why desire is not reducible to belief. 
17 This is more thoroughly argued in Kusser, sect 3.3. 
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(D-2) In fact, the history of the concept of utility in this century is 
shaped by such resistance, as is reflected, e.g., in the so-called theory 
of revealed preferences. Most thinkers, in particular economists, 
nowadays conceive utilities as representing various desires of vary- 
ing strength as they are revealed by the preferences and actions of 
rational persons. But then the question of the relation between utili- 
ties and satisfaction re-emerges. 

There has been a long-standing tendency to reverse the answer 
just discussed, i.e., to say that satisfaction is whatever we have when 
our desires come true and thus to turn satisfaction into a purely 
formal notion.’® Probably no one has put this so bluntly because it 
sounds so absurd. But the tendency to conceive satisfaction (or plea- 
sure or happiness) in this way has ample historical precedent. For 
example, you find Immanuel Kant”? saying: 

Glückseligkeit ist der Zustand cines vernünftigen Wesens in der Welt, 

dem es im Ganzen seiner Existenz alles nach Wunsch und Willen geht 

(sbid., p. 224). 

And John Rawls” saying: 

. we are happy when our rational plans are going well, our more 
important aims being fulfilled, and we are with reason quite sure that 
our good fortune will continue (ibid., p. 548). 

Nevertheless, we should resist any such tendency. We have got 
what we wanted too often without being satisfied; and the pheno- 
menology of what we have called satisfactive states is too vivid and 
too independent. So, no conceptual assimilation of satisfaction to 
realization of desires can be successful. 

There are, of course, reasons for this tendency. One reason cer- 
tainly is that it is at least the normal case that the realization of our 
desires satisfies us; but not every case is normal. A stronger reason is 
that in each case we have to assume that the realization of our 
desires will satisfy us. If we believe that some of our desires will not 
satisfy us and if we still stick to these desires, then we either have 
overriding, say, moral reasons for doing so, or we are plain foolish. 
The situation resembles G. E. Moore’s paradox of belief: we have to 
believe that what we believe is true; but others may know better, and 
we may learn to know better, too. Likewise, we have to believe that 
what we desire will satisfy us. But others may know better, and we 
may learn to know better, too. Such reasons may have furthered this 
tendency, but they do not justify it. 

™ This is nicely paralleled by a purely formal notion of fFeutility as some- 
thung you get or have whenever a deure of yours gets fullled. Cf Gniffin, ch. 1, 


19 Kritik der praktischen Vernunft, 1788. 
™ A Theory of Justice (Cambridge: Harvard, 1971). 


4 
24 THE JOURNAL OF PHILOSOPHY 


(D-3) Once we acknowledge that the relation between satisfac- 
tions and desires or utilities, i.e., the grounds for rational action, is 
contingent, the deliberational circle stands. There is some support 
for the view that, in some sense, our genuine or true desires are 
those which actually satisfy us and that we ideally have such de- 
sires.?! We share this view, and we will expand on it below. But it has 
no impact on the deliberational circle. This circle is generated by the 
relation between actual desires or utilities and satisfactions and does 
not vanish by considering how desires ideally are. 

(Œ) You might still feel uneasy about the theses leading to the 
circle. Perhaps you are tempted by the following objection. 
“Crudely put, your thesis (b) says that very often a person enjoys 
something because she wants it, and then you somehow extract a 
problem from this. But if you ask this person why she wants that 
thing, it would be natural for her to reply: ‘Because I enjoy it.’ And 
there is nothing wrong with that reply—except that you cannot 
accept both the reply and your thesis (b); you cannot have it 
both ways.” 

One can easily be misled by this consideration; indeed we have 
been. There is, however, no incoherency at all; the two ‘because’s in 
that objection simply have two different meanings. When we say that 
a person enjoys something because she wants it, we state one of the 
many causes of her enjoyment. When she says that she wants it 
because she enjoys it, she expresses one of the many practical rea- 
sons for her want. 

It is almost as simple as that, but not quite; two problems hide in 
this answer: 

(E-1) We have just admitted that the person’s belief that she will 
enjoy something is a practical reason for her desiring it. But what is 

_a practical reason? Is A not a practical reason for B just in case A is 
essentially used in a practical deliberation supporting B, i.e., in a 
decision-theoretic derivation leading to B? The problem is that the 
last sentence contains in fact two different explanations; the first is 
vague and correct, the second precise and incorrect. The conclusion 


21 Cf. Bertrand Russell, The Analysts of Mind (London: Allen & Unwin, 1921) 
ch. III, esp. pp. 72ff.; J. C. B. ing, Pleasure and Desire: The Case for 
Hedonism Reviewed (New York: Oxford, 1969), ch. 7; Ernst Tugendhat, Prob- 
leme der Ethik (Stuttgart’ Reclam, 1984), pee csp- pP. 45f.; and A. 
Tversky and D. Kahneman, “The Framing o. i and Psychol of 
Choice,” in Elster, ed., Rational Choice (New York: University Press, 1988), P- 
188. 

It would not*be correct to require of a rational person to have true desires; we 
also do not require of her to have true beliefs. Seeking, not havi know is 
essential to rationality (of course, the second often u the result of the first). Thus, 
it certainly belongs to rationality to strive after true desires. This sense of ranonal- 
ity may be intended in the above quotations of Kant and Rawis. 
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we have to face is that there are two kinds of practical reasons: the 
decision-theoretic kind and some other kind. This other kind is in- 
stantiated by the belief that one will enjoy something, but it still 
lacks an explanation. 

(E-2) The causalists in action theory, with whom we have obvi- 
ously sided, hold that, for a rational person at least, what is a practi- 
cal reason from the person's inside perspective turns into a cause 
from our outside perspective. Now, when we admit that the.person’s 
belief that she will enjoy something is a practical reason for her for 
desiring it (and think that she is rational), we also have to say that 
her belief causes her desire. And since we keep claiming that her 
desire is causally relevant to her actual enjoyment (and think that 
causal relevance is transitive), we have to conclude that her belief 
that she will enjoy something is causally relevant to its truth. Should 


we really be prepared to accept this conclusion? 
IV. HOW TO TAKE THE CIRCLE SERIOUSLY 


It seems that the circle stands unshaken, but each attempt to avoid it 
opens questions. If the circle is important, it presumably calls for a 
theory. We have none; but we want to suggest some constructive 
observations and some positive conclusions. 

We have hinted at the rudiments of a formal solution of our 
problem already in section I; let us spell them out by looking again 
at formulas (1) and (2). In these equations, the relation between A 
and B was assumed to be regular, so to speak. In the circular case, 
however, which can only obtain when B represents a future satisfac- 
tive state, the causal beliefs of the agent are expressed not by 
P(B] A), but rather by P(B| A and U"(A) = x), for any possible value 
x of U'(A). Therefore, the former should be replaced in (1) by the 
latter. Now, if U‘(A) were known to take the value x, the modified 
(1) and (2) would determine a value y of U'(A). Thus, if the value of 
U"(A) is x, it should be y; and this is satisfiable, only if x = y. In other 
words, only solutions of the equation 


(3) x = U'(A) + U'(B)P(B|A and UA) = x) + U'(non-B)P(non-B| A 
and U‘(A) = x). 


are eligible as values of U‘(A); otherwise, the total utility of A does 
not match the expected satisfaction A derives from B, given this 
total utility of A. 

The fact that the unknown x appears both at the left hand and the 
right hand side of (3) nicely reflects our deliberational circle. Note 
that no extrinsic utility is now assigned to A; when fotal utility is 
constrained by (3), it can no longer be assumed to be the sum of 
intrinsic and extrinsic utility. Equation (3) may have no, exactly one, 
or, as will be quite common, more than one solution, depending on 
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how the conditional probabilities in (3) vary with x. If the mathemati- 
cal problem were neatly formalized, one could presumably bring 
fixed-point theorems of equilibrium theory into action. We have not 
attempted to do so, for reasons stated below. 

Nevertheless, the equilibrium idea embodied in (3) is intuitively 
very helpful; it informally clarifies six of the questions we have 
left open. 

Four kinds of utility. Equation (3) specifies the relation between 
the utilities of satisfactive states and those of external states of af- 
fairs which was sought in (B) of the previous section. This relation 
complements the decision-theoretic picture of practical rationality. 
According to this picture, there are only two kinds of utility: utilities 
figuring as derivational premises, i.e., the intrinsic ones, and derived 
utilities, i.e., the extrinsic and total ones. In (3), however, utilities 
have different roles. It would be inappropriate to say that in (3) 
U!(A) is derived from U*(B) or from U*(B).” This is particularly 
obvious in the case where (3) has more than one solution. Thus, the 
intrinsic utilities of satisfactive states like B do not in general func- 
tion as derivational premises. Correspondingly, the utilities of many 
external states of affairs like A are neither derived nor intrinsic in 
the original sense. Rather, the relation between U'(B), or U*(B), and 
U*(A) in (3) is more appropriately described as one of control. The 
utilities of external states of affairs are controlled by those of satis- 
factive states in the sense that the former have to be in equilibrium 
with the latter and have to be changed whenever they are out of 
equilibrium.” Utilities may also be subject to such an equilibrium 
control—this is the essential lesson to be drawn from our delibera- 
tional circle. 

In this sense, (3) generates two new kinds of utility: controlling 
and controlled utilities. This also clears up the muddle in (C) about 
which kind of utility of A is causally relevant to satisfaction. It is the 
underived total utility of A—where ‘underived’ does not mean “in- 
trinsic” in the old sense, and ‘total’ does not mean “derived.” There 
is nothing paradoxical in this assertion, once the four kinds of utility 
have been recognized. 

Two kinds of practical reasons. What we have just said also an- 
swers the question in (E-1) of the previous section. There are indeed 
two kinds of practical reasons: those functioning as premises in a 

2 UB) and U"(B) will usually be equal if B is a satisfactive state, as it is in (3). 

Indeed, thg relation is less one-sided than the word ‘control’ ts One 
may re-establish equilibrium also by re-evaluating satisfactive states. Thus, when 
you experience that the fulfillment of a certain desire does not satisfy you, you 
may also try to learn to draw satisfaction from it, e.g., by redirecting your aware- 
ness, by depreciating alternative satisfactions, etc. Indeed, this method of accom- 


modating to desires (instead of desires to pleasures) may often be ob- 
served. It should not be dismissed as irranonal. 
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decision-theoretic derivation and those supporting equilibria like 
solutions of (3). The desire for satisfaction and the belief that one 
will enjoy something—or, more generally, utilities and probabilities 
like the ones on the right hand side of (3)—are reasons of this 
second kind. Thus, if asked why one wants something, there is noth- 
ing wrong with the answer: “Because I enjoy it.” What would in 
general be wrong is the assumption that this want is simply derived 
from the desire to feel joy. 

The causalists’ claim that practical reasons viewed from inside are 
causes viewed from outside has to be specified accordingly. It seems 
clear that this claim can only be maintained for the decision-theo- 
retic kind of practical reasons for which it was intended. The practi- 
cal reasons of the equilibristic kind at best cause one to stick to the 
desire supported by them, but not to have it in the first place. We 
thereby can avoid the undesired conclusion of (E-2); but the next 
point looks into the matter more carefully. 

The status of the belief that the realization of a desire will sat- 
isfy. What we have just said means that this belief accompanies 
rather than precedes the desire. Indeed, it is a necessary companion 
of the desire. Unless one has other reasons for a desire, one would 
be irrational to have the desire and lack the belief that its realization 
will satisfy. This or rather the probabilistic counterpart thereof is 
what (3) asserts; and this agrees well with what we have observed 
in (D-2). 

What about the suspicion lurking in (E-2) and also at the end of 
(D-1) that this belief is not a normal empirical belief, but rather one 
of the self-verifying kind? We have to be careful here concerning the 
precise content of this belief. It may be (i): the realization of a 
desired proposition, say, A, will satisfy. Or it may be (ii): the realiza- 
tion of the proposition A as a desired one will satisfy. Anyone who 
believes (i) will probably also believe (ii); and anyone who believes (ii) 
and is conscious of his desire for A will certainly believe (i). Still 
there is a subtle difference. If one’s causal belief is expressed by (i) 
and one overlooks or denies that one’s desire for A is causally rele- 
vant to how satisfying A is, then one can maintain the received pic- 
ture—at the cost that the belief in (i) is in fact causally affecting its 
truth. If, however, one’s causal belief is expreased by (ii), one has to 
give up the received picture with respect to the utility of A—with 
the benefit that the belief in (ii) is a normal empirical belief. Only 
this second alternative can be sustained. The first is self-defeating; as 
soon as one realizes that one’s belief in (i) is affecting its own truth, 
one can no longer stick to it. 


Changes of the belief that the realization of a desire will satisfy- 


When we said that this belief necessarily accompanies the desire; this 
did not mean that this belief would merely be a by-product, of the 


a i 
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desire. On the contrary, under reading (ii) it is a normal belief to 
which normal epistemology applies. It may be grounded or un- 
grounded by evidence; it may be false; one may learn that it is false; 
and, to put it so generally as to cover also the probabilistic case, one 
may change it on the ground of new evidence according to general 
rules of rational belief change. But which consequence has such a 
change for the desires? Again, we have to distinguish two cases. 

The normal case is that belief change only calls for a recalculation 
of the expected utilities. This may even be true of changes of the 
beliefs about future satisfaction. For instance, I might decide in a 
restaurant to try an exotic dish that I have never tasted before. It 
turns out that I do not especially like it, and I accordingly revise my 
belief about the satisfying efficacy of that kind of dish and thus its 
expected utility. This case allows such a simple description, however, 
only when our circle is absent, i.e., when one’s desire for a dish does 
not affect how much one likes its taste. 

The case is very different when our deliberational circle is pres- 
ent. The change of the beliefs about future satisfaction, i.e., of the 
probabilities involved in (3), then results in a predicament: the con- 
trolling desire for satisfaction and the controlled desires for other 
things are out of equilibrium and can no longer be maintained ratio- 
nally. Somehow, they must be changed; somewhere, a new equilib- 
rium must be found; but there is no hint about how to change them 
and where to seek it. Everyone will know the experience of running 
on empty—e.g., when the thrill of rock ‘n’ roll fades or when one’s 
love is worn out—and the often difficult process of redirecting 
one’s desires. 

Equation (3) as a model of deliberation. Equations (1) and (2) 
provided rules of practical deliberation; they told us how to derive 
extrinsic and maximize total utilities. Equation (3), which replaces 
(1) and (2) in the cases discussed, might thus also be taken as offer- 
ing a model of deliberation, namely, this one: solve (3); if it has no 
solution, that is too bad (though this may be ruled out by suitable 
continuity assumptions); if it has a solution, fix U'(A) to be the 
maximal solution—because among all the eligible values of U‘(A) 
this is the one which gives you the highest expected satisfaction! If 
this were a feasible model of deliberation, then it should be helpful 
to work out the mathematics. But we doubt this very much, for two 
reasons. 

First, the relevant probabilities seem scarcely to be available; for 
knowing them means having an answer to all the questions of the 
following kind: “If I should desire A so and so strongly, how likely is 
it that the realization of A will satisfy me to such and such a degree?” 
But we simply do not know and thus cannot grade many of the 


THE UTILITY OF PLEASURE IS A PAIN FOR DECISION THEORY 29 


possible satisfactions. How does it feel to be a free climber? How 
satisfying would it be? We have not the slightest idea. And it is even 
more difficult to take into account the influence of the desire. How 
satisfying would it be to be a free climber, if one really wanted to be 
one? Thus, we can plausibly answer these questions only for the 
kinds of satisfactions we have experienced and for the kinds of de- 
sires we have pursued; all other probabilities would be artificial 
guesswork. 

Secondly, the formation of aims and ends, i.e., of underived de- 
sires is quite obscure; the idea that one could adopt, by sheer will, so 
to speak, the desires that such deliberation recommends, looks a bit 
strange. Thus, it seems doubtful whether such deliberation could 
really be as effective as it should be in order to have a point. 

Rational and true desires. If (3) should not be understood as a 
model of deliberation, what does it then accomplish? It provides a 
model of learning—at least in the weak sense that it says that de- 
sires in disequilibrium should rationally change and that this change 
should rationally move to some new equilibrium. 

Indeed, (3) establishes an extended notion of rational desire. In 
the received picture, there was no way at all of assessing intrinsic 
desires or utilities as rational or irrational; this had a precise mean- 
ing only for extrinsic desires. In our amended picture, however, (3) 
provides a standard of rationality also for controlled as well as for 
controlling desires (and thereby opens a possibility of grasping the 
notion of rational feelings). This is tantamount to saying that (3) 
provides extended means of criticizing the desires of a person, not 
according to moral or otherwise external standards, but strictly by 
that person’s own measures. 

Insofar as the beliefs that the realization of desires will satisfy may 
be true, or insofar as the probabilities in (3) may be the objective 
ones, (3) finally gives some meaning to the notion of true desires: 
our true desires are those which are in a maximal equilibrium given 
true beliefs. We rationally search for true desires; and since the 
causation of our satisfactions keeps changing, we have to search 
again and again. This is what we in fact do. We do not gain practical 
help from (3), but perhaps some theoretical insight into what we are 
seeking. 

ANNA KUSSER 
Freie Universität/Berlin 
WOLFGANG SPOHN 
Universität Bielefeld . 
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ALLOWING FOR UNDERSTANDINGS 


ET me begin with some history. In days gone by, the annual 

American Philosophical Association meetings were held at 

colleges and universities, and there was always a banquet, at 
which the president gave his address. One year at the end of the 
1950s, the APA met at Columbia. Being then young and just mar- 
ginal members, my friends and I were placed at the banquet in a far 
corner of the room. How marginal we were came out soon enough, 
while the others were eating, when we saw that we hadn’t been 
served. It turned out the kitchen had run out of food. We were least 
likely to make a fuss and so had been picked for doing without. 

I made the suggestion that we leave and get burgers and come 
back for the talk, but this found no takers. One person sitting at that 
table was loudly and firmly opposed to it. He said he had already 
spent $5.00 for dinner, and wouldn’t spend a penny more. I said 
that, if we left, we would each get a burger and pay just $2.00, while 
if we stayed we would starve. He replied that that was ridiculous, 
that if he did what I proposed, he would in the end have paid $7.00 
and only got a burger for it. He said he would rather starve. The 
others agreed, and so we stayed and ate celery. 

The reader may need some assuring: there is a philosophical issue 
in this. That issue is, I think, deep and important, and it connects 
with many others. Still, it has been almost wholly ignored. The rea- 
son may be that seeing it clearly calls for giving up certain convic- 
tions that are usually held without question. The issue can be fully 
grasped only in a proper framework of thought. I will discuss the 
framework first and leave the problem of the burgers for later. 

Consider this experiment by Daniel Kahneman and Amos 
Tversky.’ People were put in two situations. Each person in situation 
I was told to assume he had just got 1,000 (in Israeli pounds). He 
was then asked to choose between (A) getting another 500 for sure 
and (B) an even chance of getting another 1,000 or getting nothing 
more. In situation II, each subject was told to assume that he had 
been given 2,000 and was asked to choose between (C) losing 500 
and (D) an even chance of losing 1,000 or losing nothing. The major- 
ity of the people in I chose (A), and the majority of those in II chose 
(D). If they had been in both situations (first in one and then the 
other), many might have chosen both (A) and (D). 


1 “prospect Theory: An Analysis of Decision under Risk,” Econometrica, 
XLVII (1979): 263-91. 
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There seems to be something wrong, for suppose that the out- 
comes are reported in terms of the chooser’s final assets in them. 
Since getting 1,000 and then 500 more comes to the same as getting 
2,000 and losing 500, etc., the two situations both reduce to III, to 
having to choose between (E) winding up with 1,500 and (F) an even 
chance of winding up with 2,000 or 1,000. It seems that, if anyone 
chose (A) and (D), he faced the same issue twice, and the first time 
chose one option, the second time the other. 

How do Kahneman and Tversky account for the likely (A) plus (D) 
choices? They suggest that people, in choosing, focus not on total 
final positions but on the gains or losses their actions might generate 
for them: “The apparent neglect of a bonus that was common to 
both options [the 1,000 in one case and 2,000 in the other]. . . 
implies that the carriers of value or utility are changes of wealth, 
rather than final asset positions that include current wealth” (ibid., 
p. 273). Hence the priority assigned to the setups in I and II over 
that in III. They also suggest that the value curve for money is 
roughly S-shaped, that it goes through the origin, and is generally 
concave for gains and generally convex for losses. This last implies 
that where, as in situation I, a person must choose either a gain for 
sure or a chance of gaining and these options have the same money 
expectation, he is risk-averse, and that where, as in situation II, he 
must choose either a loss for sure or a chance of losing with the 
same money expectation, he is risk-inclined: hence the choices of 
both (A) and (D). 

Notice the similarity to the burger problem. In that, I held that, if 
we went out to eat, the outcome would be a burger for $2.00, a 
clearly positive gain, that the $5.00 we spent for the banquet was 
now long gone and irrelevant. My friend—let me call him Sidney— 
held that the local context matters, and that the past can’t be kept 
out of that. I reasoned as did the (A)-and-(D) choosers, whom 
Kahneman and Tversky thought foolish. That is, I reasoned as if I 
took “the carriers of value . . . [to be] changes of wealth... .” 
Sidney sided with Kahneman and Tversky; he thought in terms of 
“final asset positions.” I saw a gain in store if we left and only a small 
chance of dinner if we stayed. Thinking of the gains made me averse 
to the risk, and so I suggested we leave. Sidney noted that our final 
position if we left would be negative (counting what we already had 
spent, he saw it as a loss) and that we still had a chance of eating (of 
breaking even) if we stayed. This inclined him to take the risk and so 
he argued for staying. 

A second account is also possible of how all these choices were 
made, and Kahneman and Tversky brush shoulders with it. They 
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preface their report of their experiment by saying: “We now show 
how choices may be altered by varying the representation of out- 
comes” (ibid.). Here it isn’t a matter of outcomes versus final asset 
positions but of different presentations of the outcomes, of differ- 
ent reports given of them. And indeed the (A}and-(D) choosers, 
unlike the hypothetical (E) and (F) choosers, built the gains and 
losses they expected directly into their descriptions of the options. 
The authors make no allowance for such differences in their explan- 
atory “Prospect Theory,” nor do they think they can be used to 
rescue these people’s rationality, for they hold that a sensitivity to 
presentations marks a failing of its own. A person who changes as 
the presentations change (where he knows the presentations are 
coreportive) violates what they call “invariance.” Thus he departs 
from what they consider essential to common sense.” 

In several papers and in a book just published, I have taken the 
opposite view. The way people put the facts to themselves I have 
there called their understandings of them, how these people see 
those facts. I have held that taking note of understandings is essen- 
tial not only to explanations of actions but also to any judgment of 
whether some choice or action was rational. My case for the agent’s 
understandings being essential to an explanation of his conduct 
rests on some general theses. The first is that desires and utilities 
focus on propositions, that the “bearers of value” are not bare phys- 
ical facts but facts as the agent reports them. Also that any fact can 
be differently reported or propositionalized by the agent—differ- 
ently understood or seen by him. Also that propositions reporting 
the same are sometimes assigned different utilities by a person even 
where he knows they are coreportive, and that the way a person acts 
reflects the utilities (and probabilities) he assigns. 

Here our topic isn’t explanation but judgment, the judgment of 
rationality, but much the same can be said. My contention that a 
person’s understandings bear on judgments of the rationality of his 
conduct stems from my wanting a logic of action that is more open, 
more indulgent than the usual. And I want such a logic because I 
think that the (A)-and-(D) choosers made the right choices, given 
their understandings, though Kahneman and Tversky, given thetr 
understandings, would have been right to choose differently. Also 
that my stand on the burgers was sound, given my understandings, 


? See their ‘The Framing of Decisions and the Psychology of Choice,” Science, 
CCX] (1981). 453-8, and their ‘Rational Choice and the Framing of Decimons,” 
in Robin M. Hogarth and Melvin W. Reder, eds., Rational Choice: The Contrast 
between Economics and Psychology (Chicago: University Press, 1987), pp. 67-94 
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and that Sidney’s was too, given his. In other studies,” I have worked 
out such a logic, one that allows for each of the courses in (several 
of) Kahneman and Tversky’s experiments, and also in Allais’ prob- 
lem, in Newcomb-like problems, and in various others—a logic that 
approves different courses for people who see the same things dif- 
ferently. 

I won’t go into these matters now. What I want to do instead is to 
attend to the likely critics, to consider the sort of objections that 
might be brought against my whole project. These fall under three 
separate headings: ideological, formal, and visceral. Some comment 
is needed on each. 

The first sort will not keep us long, though the issue here is cen- 
tral. I want to arrange for a logic of action that allows for people’s 
understandings. This means that the logic of action I want has to be 
intensional. More strictly, it means that its theory of value has to be 
intensional, that the theory may not fault someone’s setting differ- 
ent utilities on propositions he thinks coreportive.* If propositions 
that were thought coreportive had to be given the same utility, it 
couldn’t matter which propositions reported the facts as the agent 
understood them. A person’s understandings couldn’t matter on 
their own. Many writers hold that they oughin’t to matter: proposi- 
tions that we think coreportive ought to be valued alike by us. These 
writers hold that a proper logic has to be extensional. 

The issue is central, but on this level, notbing much can be done 
with it; everyone just beats his own drum. Here are Kahneman and 
Tversky again, likening people’s judgments of value to their judg- 
ments of height. “Veridical perception requires that the perceived 
relative height of two neighboring mountains, say, should not re- 
verse with changes of vantage point. Similarly, rational choice re- 
quires that the preference between options should not reverse with 
changes of frame.” 

Kahneman and Tversky speak of preferences only, but they would 
say the same of utilities: rationality requires that the utilities we set 
do not change with our understandings. We face some confusion 
here, for the analogy of the mountains is poor. The measurement of 
height is real-object oriented; our “vantage points” or “framings” 
of an object are irrelevant to the height that it has. By contrast, the 


3 The logic is put most fully in my Understanding Action (New York: Cam- 
bridge, 1991). 

1 My use of “‘intensionality” is not the usual, but it retains the core idea of the 
nonsubstitutivity of certain related propositions; see Understanding Action, pp 
95-101 

5 “The Framing of Decisions and the Psychology of Choice,” p. 453. 
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measurement of utility is oriented to propositions—we evaluate 
propositions—and propositions are framings. So the thesis of inten- 
sionality is not what the mountains analogy suggests. It isn’t that 
different framings make for different valuations of the facts (the 
facts themselves don’t get valued), but that different framings of 
facts can have different values set on them, and this even where 
these framings are framings of what is thought the same fact. 

Let me beat that drum a bit more. The self-same fact can be 
variously framed: it can be variously propositionalized. Thus my ba- 
sic idea is not that the way we frame or understand things affects the 
utilities we set. Rather, the idea is that our understandings select the 
propositions the utilities of which will matter, and might even settle 
on propositions whose utilities are different from those of others we 
think are coreportive with them. I hold that, in picking those propo- 
sitions whose utilities will then figure for us, our understandings 
affect our choices. I also hold that this needn’t be faulted—that we 
needn’t insist on “invariance.” 

Again, that is the stand I take, but ought it to be endorsed? Ought 
a logic to allow for it? Isaac Levi,° speaking for the negative, says 
that ‘“‘[o]ne of the advantages of [a logic that doesn’t] is that its 
prescriptions are robust under transformations of the way possible 
outcomes are represented” (ibid., p. 342). This begs the question 
before us: Is it indeed an advantage? My claim is that it is not, no 
more so than it would be an advantage if the prescriptions of a logic 
were robust under transformations of the agent’s probabilities. The 
robustness of a logic is its ignoring (or abstracting away from) cer- 
tain kinds of variables, and such ignoring isn’t always a virtue. In 
particular, it isn’t one here. 

We turn now to some matters of logic. An ideological resistance 
to my not ignoring understandings need not stand in opposition 
alone. It may be supported by the contention that thinking like mine 
meets with formal problems, that it runs afoul of basic principles of 
utility theory. Here there are serious difficulties. 

Let u(h & k) be the utility a person sets on h-and-k. Let u(h, k) be 
the utility of h conditional on k, the agent’s degree of disposition to 
want A if, in addition to all else he believed, he now believed k. (This 
covers the case in which & is an item the agent already believes.) A 
familiar principle of conditional utilities is 

(1) ulh, k) = u(h & k) 

Suppose that*the agent thinks that A and n are coreportive. Then he 


*“A Note on Newcombmania,” this JOURNAL, LXXIX, 6 (June 1982): 337-42. 
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believes that A is true if and only if n is. Putting A-iff-n for k in (1), 
we get 
ulh, h-iff-n) = u(h & h-iff-n) = u(h & n) 
With h and n interchanged, we get 
u(n, n-iff-h) = u(n & n-iff-h) = u(n & h) = ulh & n) 
It follows that 
u(h, h-if-n) = u(n, n-iff-h) 
But since the agent believes h-iff-n and so also n-iff-h, this comes to 
u(h) = u(n) 


In sum, where a person thinks that A and n are coreportive, the 
utility he sets on A must be the same as that which he sets on n. Or 
rather, this follows from (1), the principle of conditional utilities. 
Principle 1 yields extensionality (invariance), just the constraint I 
want to avoid. I seem to be in some trouble here. 

Extensionality is also imposed by the usual principle of expected- 
ness. Let p(k, A) be the probability of k conditional on A. The usual 
principle is 

(2) ulh) = p(k, A)u(h & k) + pin, h)u(h & m) 


where p(k-&-m, h) = 0. In this, put A-iff-n for k and not-(h-iff-n) for 
m. Since the agent believes h-iff-n, p(h-iff-n, h) = 1, and so p(not- 
(h-iff-n), h) = 0 (both by the axioms of probability). Principle 2 thus 
says that u(h) = 1 X u(h & h-iff-n) + 0, which comes to 
u(h) = u(h & n). Doing this over with h and n interchanged, we find 
that u(n) = u(n & h). Here too it must be that u(h) = u(n). Inten- 
sionality is ruled out again. (The rulings-out are related, for (1) can 
be shown to follow from (2).) 

I think the message is this. The usual logic does indeed impose 
extensionality on us. It might be said that is part of its purpose, part 
of what it is for. Lf we want more freedom here—if we want to free 
ourselves to set different utilities on propositions we think corepor- 
tive—we need a less constraining logic. We have to weaken the usual 
principles in some suitable way. 

I will put that off for a moment, for there is also a second prob- 
lem. When I argued for getting a burger, I did this because I saw my 
getting it as I get a burger for $2.00. I knew of course I had spent 
$5.00 already and that the outcome thus might have betn put as J 
get just a burger, having spent $7.00. 1 preferred getting a burger 
for $2.00 to starving, though I preferred starving (that is, eating 
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celery) to getting just a burger when I had spent $7.00. Let I get a 
burger for $2.00 be h, and let I already spent $5.00 be k. Then I 
get a burger, having spent $7.00 is h-and-k. Since I believed k, it 
follows by (1) that I should have valued A and h-and-k alike, should 
have equivalued I get a burger for $2.00 and I get a burger, having 
spent $7.00. And so (1) rules out the preferences I had, for those 
bound me (by transitivity) to preferring h to h-and-k. 

The general point is that our usual logic implies a certain clump- 
ing. I will say it agglutinates: wherever a person believes any k, he 
must assign to every A the utility he assigns to h-and-k. Whether k 
enters his understanding of what h reports doesn’t matter. (Whether 
my already having spent $5.00 entered my understanding of getting 
the burger doesn’t matter.) A person in the first of Kahneman and 
Tversky’s situations should have valued I gain 500 and I wind up 
with 1,500 alike (the latter being equivalent to I gain 500 and I 
just got 1,000). A person in their second situation should have set 
the same utility on J lose 500 as on I wind up with 1,500. Likewise, 
a person should set the same utility on J find a dollar bill as on I 
find a dollar and then one day I die. 

Some people will bite the bullet and say that the utilities should be 
the same, even in the dollar-finding case. (Since we know we are 
mortal, our reaction to that should color all else.) I think that they 
might be the same, but that our logic should not demand it. So I 
think that the usual principles need to be weakened here too. Notice 
that agglutination is more basic than extensionality: only where they 
agglutinate must we agree that utilities are extensional. If we can 
block the former, we can be free of the latter. 

We want to weaken the principles so as to give due weight to 
people’s understandings. Here we will apply the concept of their 
conditional understandings—their dispositions to understand 
things if they believed, say, k. Let me say that a proposition that 
reports a fact the way the agent understands that fact is a compel- 
ling proposition for that agent (we might have also called it salient). 
We will say h is k-pre-emptive for him where h would be (or would 
remain) compelling for him if he believed k. We will say h is k-adop- 
tive for him where it isn’t k-pre-emptive and, moreover, if he be- 
lieved k, he would understand the fact A reports as the conjunction 
h-and-k. 

Back now to principles 1 and 2. We will add this proviso to (1), 
that A is k-adoptive: (1) will hold for us only with that proviso at- 
tached. Wé will add the proviso to (2) that A is both k-adoptive and 
m-adoptive. These provisos are being added to principles that pres- 
ent the utility of h in terms of the utility of that proposition con- 
joined with one or more others. They confine the force of these 
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principles to where that (or those) other proposition(s), if believed, 
would contribute to the agent’s understanding of what is reported 
by A. 

With the provisos in place, our logic no longer aggtutinates. No 
problem now with my having set different utilities on J get a burger 
for $2.00 and on I get a burger, having spent $7.00, for I get a 
burger for $2.00 was not I-already-spent-$5.00-adoptive. The 
former was pre-emptive for me; though I knew I had spent $5.00 
already, I still saw my getting that burger as I get a burger for 
$2.00. Likewise (I would guess) for the (A)-and-(D) choosers and 
also for the dollar-bill finder. To the average person, IJ find a dollar 
bill is one-day-I-die-pre-emptive; though he knows he will die one 
day, he still sees his finding a dollar as just J find a dollar. 

Extensionality is lifted too. Suppose again that the agent believes 
that h-if-and-only-if-k. The new (1) and (2) here get us u(h) = u(n) 
only in certain special cases, only where A is (h-iff-n)-adoptive and n 
is (n-iff-h)-adoptive. They get us u(h) = u(n) only where the fact 
reported by h is understood by the agent as h-and-n (that is, as 
h-and-(h-iff-n)) and the fact reported by n is understood as n-and- 
h. And so we get this only where the agent sees the fact(s) reported 
by h and by n the same way. That gives us all the sensitivity to 
understandings we need. 

The reader may want a basic principle less saddled with provisos 
than (2). If so, he might accept 


(3) u(h) = p(k, h)u(h, k) + pim, A)ulh, m) 


which holds given only that p(k-&-m, h) = 0. Nothing excessive fol- 
lows from (3). Where we believe k, we get just u(h) = u(h, k)— 
we are spared agglutination. Where we believe h-iff-n, we get just 
u(h) = u(h, A-iff-n) and u(n) = u(n, n-iff-h)—we are spared aggluti- 
nation and the consequent extensionality. Before we qualified (1) as 
we did, (3) and (2) were equivalent, but (3) is now the more general 
principle: it holds where (2) does not. Where h is k-adoptive and 
m-adoptive, (3) reduces to (2) via (1). Where it isn’t, (2) does not 
apply but (3) does apply. 

For some, this may be too much—here we arrive at the visceral 
part. Too many headaches, too many provisos, and now a new gen- 
eral principle! Thank you very much, but no! People oughtn’t to 
monkey with logic. The classical principles are simple and clear. 
They work well enough for our purpose. Where they conflict with 
judgments we make, it is those judgments which should be changed. 

The question is: Do they really work? I don’t think that they do. 
They are, I think, too constraining. They impose extensionality and 
agglutination. Thus they screen off distinctions that matter, distinc- 
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tions of how things are seen. That often makes for confusion, and it 
has left a mess of conundrums in decision theory. This of course has 
to be argued for, but I won’t do that here.” 

The basic negative gut reaction may be that the above is ad hoc. I 
noted what I took to be problems and tinkered with things until they 
were blocked. The critic may say this is too self-indulgent. He may 
insist that a settled logic has a claim to greater respect. Such a logic 
ought not to be changed to suit every contrary whim, no more so 
than we should weaken our ethics to cater to every temptation. 

Yes, of course, about whims and temptations, but what is a whim 
and what is not? And what is too much self-indulgence? The golden 
Tule of metaphilosophy is that of reflective equilibrium: “A [princi- 
ple] is amended if it yields a [judgment] we are unwilling to accept; a 
[judgment] is rejected if it violates a [principle] we are unwilling to 
amend.”*® I am unwilling to accept the judgments of extensionality 
and agglutination. The critic is unwilling to amend the familiar tra- 
ditional principles of utility. Here we must each of us go his own 
way, and I have gone by way of attaching provisos regarding under- 
standings. i 

The reader may make the objection that I haven’t said what un- 
derstandings are. That of course is correct. I haven’t said what be- 
liefs are either, or desires or preferences or probabilities or utilities. 
These are all basic mental states (the probabilities here are subjec- 
tive probabilities), states too basic to be defined in terms of anything 
else. The nature of beliefs and desires and the rest emerges only in a 
theory of mind, in what such a theory says about them. In part, it 
comes out in a theory of reasons, in how that theory says they com- 
bine to make for action. The same is true of understandings: their 
nature too comes out in such a theory. I have said briefly why I 
believe the explanation of action must take account of understand- 
ings. I will here have to leave it at that.’ 

We can now finally speak of the issue the burger problem raises. 
Should we have left to get burgers? The answer I gave to that was 
proper, given my understandings (and my beliefs and desires), and 
Sidney’s answer was too, given his. Logic—that is, my modified 
logic—faults neither him nor me. It endorses both our positions, 
given our different understandings. But what about those under- 
standings now? Was his more sound, more right than mine, as he 
implied by calling me ridiculous, or was mine more sound than his? 
What makes an understanding sound, and what makes an under- 


7I argue for it in Understanding Action. 

8 Nelson Goodman, Fact, Fiction, and Forecast, 4th ed., (Cambridge: Har- 
vard, 1983), p. 64 

° Again, I put my position fully in my Understanding Action 
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standing unsound? Or is there no such thing as soundness, and 
every understanding is as good as any other? 

Regarding the issue between Sidney and me, one might perhaps 
hope to find help in economics. For our issue was that of sunk costs, 
of whether money already spent should count in deciding between 
courses still open. Should the counting start from now or from some 
point in the past? Economists side with me on this question of 
whether to let bygones be bygones. Here is a typical statement: “[A 
sunk cost] should play no role (except as a help in forecasting. . .). 

. For any ensuing decision, only the escapable, or ‘variable’ coats 
are relevant.”!° But the point is taken for granted; there is no argu- 
ment offered for it. So it cannot settle the issue against someone 
(e.g., Sidney) who disagrees. 

Let us get beyond Sidney and me, beyond just the issue of sunk 
costs. Our two different understandings didn’t exhaust the possibili- 
ties. Someone else may have seen the outcome of leaving the ban- 
quet as we eat at the deli, or as simply we eat—no mention of costs 
in that. A vegetarian would have seen it as we resort to eating meat. 
All of these propositions would have been true had my suggestion 
been taken. The question is whether any (or all) of them would have 
been appropriate, whether any (or all) would have served to express 
a proper understanding of the outcome. Or better, the question we 
have is this, what decides such an issue? Are there standards of 
proper understanding, and if so, what are they? 

The question here is very broad. The understandings a person has 
enter his reasons for whatever he does. They enter because the 
components of his reasons—his beliefs, desires, utilities, and the 
like—focus on propositions reporting the facts as this person un- 
derstands them. Is his understanding in a situation improper? The 
reason that moves him must then be too: the beliefs and desires and 
the like he brings up are not then those he should have brought up. 
So the issue of the propriety of understandings seeps into that of the 
propriety of reasons and the question is hard to avoid. What stan- 
dards, if any, apply? What makes an understanding (and thus a rea- 
son) proper? When are we seeing things right? 

We look in vain for an answer to the classics. The few who remark 
on understandings at all try to shake off the question of their right- 
ness. Aristotle does it by definition. For him, an agent’s minor prem- 
ise reports his “grasp” or ‘“‘understanding”’ of some option, but also, 
understandings must be right; they wouldn’t be understandings if 
they were wrong: “Understanding is identical with goodness of un- 

10 Armen A. Alchian, “Cost,” International ee eee of the Socal 


Sciences (New York: Free Press, 1968); repr. in ed., Price 
Theory, 2nd ed (Harmondsworth: Penguin, 1980), p. eaha. 
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derstanding, men of understanding with men of good understand- 
ing.”’’’ Since not all people are in this elite, we need a word for how 
those not in it see the facts—‘misunderstand’ will do. This leaves us 
the question of how to distinguish understanding from misunder- 
standing, and that is no different from the question on which we are 
looking for help. 

Here is Kant on understanding, which he refers to as “‘judg- 
ment”: “judgment is a peculiar talent which can be practiced only, 
and cannot be taught.’’"* He also puts it differently, that the special 
talent is that of making right or proper judgments. He speaks of 
judgment as the application of rules—as the grasping or shaping of 
facts to which our laws (the rules) might apply—and he holds that 
“the power of righily employing them,” of rightly applying them, is 
a “natural gift” (ibid. ; emphasis added). An exercise of this natural 
power yields a right judgment by definition. (Kant is here joining 
with Aristotle.) The difficult problem is with us still. It is to distin- 
guish what is and what isn’t an exercise of this power. When is a 
person judging correctly? When is he seeing things right? On that 
we still have no clue. 

Among the moderns, one might look to Goodman, who studied a 
special case of this question (op. ctt., pp. 72 ff). A hundred marbles, 
all of them blue, have been drawn from an urn; the next to be drawn 
will very likely be blue. All the drawn marbles were also bleen, a 
bleen marble being one that is either drawn before tomorrow and is 
blue or is drawn after and is green. Parity of reasoning suggests that 
the next marble is likely to be bleen, but this says that if it is drawn 
tomorrow it is likely to be green and so not likely to be blue. For- 
mally identical inferences from the same evidence put in different 
ways give us contradictory conclusions. We avoid the contradiction 
by rejecting the bleen descriptions. But what is our warrant for 
rejecting them? What makes these descriptions improper? 

Goodman develops an answer in terms of the record in prior 
inferences of the basic words involved (of the words ‘blue’ and 
‘bleen’). Whether or not this approach of his works, its focus is very 
narrow. It deals with a special question only. It has to do just with 
what descriptions are proper for inferential purposes—not with 
right seeing in general, but with right seeing in inference only. The 
general question remains: When is a description, in any context, 
proper? When are we seeing or understanding things right? 

There are thinkers who reject this question, who think it is mis- 
guided. They deny that one can speak of the rights and wrongs of 
understandings except insofar as the judgments involved have to do 


1l Nicomachean Ethics, 1148a 
1? Criitque of Pure Reason, A188, B172. 
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with some suitability or coherence. They are content to ask how an 
understanding advances the agent’s purposes. Or they may ask how 
it squares with our own or with social custom, whether that under- 
standing is widely shared, or is the sort that most people have, or the 
sort accepted by people in some group to which the agent belongs. 
Right and wrong have no strict application: people whose seeings or 
understandings differ live in (have built) different worlds, and no 
judgment of propriety can be made of one world by those who live 
in another."® 

Similar stands have also been taken regarding desires and even 
beliefs, that there are no rights or wrongs there. We have the beliefs 
and desires we have, and if they cohere, no more can be said. There 
are no external constraints; everyone can suit himself. I resist this 
libertine view, though I have no good case against it. In the end, it 
may win by default, but in the meantime I put it off. I hold it is 
wrong to think the earth flat and wrong to want to push people 
around, and that these judgments of wrongness can be grounded in 
some impersonal, externalist way. 

So too with understandings. I hold that some seeings are wrong, 
that it is wrong to see marbles as bleen, even where they are—that 
even a marble we know to be bleen ought to be seen as blue or as 
green—and that this is more than mere preference. Also, on the 
political front, that it is wrong to see affirmative action (hiring) as a 
sort of discrimination, though in fact that is what it is, and that this is 
not just my policy bias dressed up to look more impressive. I hold it 
is wrong to see abortions as killings, though that is what they are, 
and wrong to see executions for murder as arrangements for getting 
even. But then I still face the question, what grounds do I have for 
saying all this? What makes it wrong to see these things so? When is a 
seeing right and when wrong? 

Having now asked this several times, I will let it go.’* I have no 
better handle on it than does anyone else. So we have made little 
progress. We began with some hungry philosophers around a table 
thirty years ago, and we wind up with empty bowls here again. Still, 
let me look at the bright side. I fancy that the question we have is a 
deep and difficult one. It is a question that calls for an answer and a 
question that has been ignored. So at least we have something to 
chew on, which is more than we had back then. 

FREDERIC SCHICK 
Rutgers University 

13 This last is Goodman's ; see his Ways of Worldmaking (Indianapolis: 
Hackett, 1978). See also Richard Rorty’s writings, esp. his Contingency, Irony, 
and Solidarity (New York: Cambndge, 1989). 

14 I consider some answers and why they fail in Understanding Action. Also in 
my “Under Which Descriptions?” in Amartya Sen and Bernard Williams, eds., 
Utslitananism and Beyond (New York: Cambridge, 1982). 
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COMMENTS AND CRITICISM 


SUPERVENIENCE/DETERMINATION A TWO-WAY STREET? 
YES, BUT ONE OF THE WAYS IS THE WRONG WAYI* 


N a recent article in this journal, Richard B. Miller! argued that 
I supervenience/determination relations are ‘“‘metaphysically in- 

significant” because they hold trivially and symmetrically be- 
tween domains of interest: e.g., not only does the physical trivially 
(sic) determine the mental, the moral, the aesthetic, etc., it is equally 
true that the mental, the moral, the aesthetic, etc., each determines 
the physical. The appearance to the contrary is an illusion resulting 
from not granting the gander (“‘higher-order” domains such as the 
mental, the moral, etc.) what we grant the goose (the physical do- 
main), namely, a language rich in its capacity to express fine and 
far-reaching distinctions, via mathematical, relational, and disposi- 
tional predicates. Extending similar expressive capacity to the 
higher-order languages results in the claimed symmetry—hence 
“metaphysical insignificance’”’—of supervenience. 

These claims are, in my view, without merit, and it is the purpose 
of this note to indicate why. This will also afford an opportunity to 
clarify certain aspects of supervenience/determination concepts rel- 
evant to their intelligent application, and to underscore how it is 
that they indeed mark major metaphysical, scientific distinctions and 
are worth pursuing. The key points are the following three. 

(1) The formulation of ‘‘physicalist supervenience/determina- 
tion” Miller works with is tendentious. It says, in effect, 


IPI: Any two [actual] things that differ in higher-order respect R 
differ [somehow] physically. 


The bracketed words highlight the two key problems with the for- 
mulation. IPI stands for “Identity of Physical Indiscernibles,”’ as it 
was dubbed by Frank W. Thompson and myself? (cf. formulas (2) 
and (3), tbid., p. 555). 

Now, it is easy to trivialize supervenience/determination with such 
a formulation, for indeed any two things do differ physically, on 
various constructions of what this means (with appropriate caveats 


ba I am indebted to Jaegwon Kim for helpful correspondence 

' “Supervenience Is a Two-way Street,” this JOURNAL, LXXXVII, 12 (December 
1990): 695—701. 

z “Physicalism: Ontology, Determination, and Reduction,” this JOURNAL, 
LXXI, 17 (October 2, 1975). 551-64 
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for quantum field-theoretic entities, whose status as “things” is in 
any case problematic). For example, of course, (classical) physical 
things differ at least in their spatiotemporal locations, and location 
properties are the subject of physics. 

Indeed, the supervenience literature contains many different for- 
mulations of determination relations (as Miller noted, 696, fn. 4). 
But all serious formulations have modal force.” Whatever the spe- 
cific machinery employed, the aim has been to capture intuitive 
theses such as, 


Were any two objects sufficiently similar in relevant physical respects 
(taking account of their environments), they would also share higher- 
order properties. 


In fact, the original formulation of “physical determination” (fn. 2 
above) was explicitly designed to improve upon IPI, which was the 
weakest formulation of “no difference without a physical differ- 
ence” considered: it does not even quite imply purely ontological 
physicalism, and was explicitly regarded as palpably too weak to 
capture the idea of “determination.” The principles articulated as 
genuine determination principles employed model-theoretic notions 
as a way of expressing modal force. Also, they were framed globally, 
building in no limits to the extent of environment that might be 
relevant. They were different ways of making precise the informal 
condition: 


Any two global (mathematically standard) models of scientific laws 

(av any two naturally possible worlds) indistinguishable in relevant 

physical respects, are indistinguishable in higher-order respects as well. 
Note that this calls for “interworld comparisons,” and it allows us to 
consider, for example, ‘‘counterparts” that share spatiotemporal 


properties. The fact that any two actual things have different loca- 
tions becomes entirely irrelevant. 


3 For a useful , see Paul Teller, “A Poor Man’s Guide to Supervenience 
and Determination,” Ghaukers Journal of Philosophy, xxu, Supplement (1984): 
137-62. See also Thompson and my op. cit., and our “Physicalist Materialism,” 
Not, x1, 4 (1977). 309-45; Kim, MCondepts of Supervenience,” Philosophy and 
Phenomenological Research, XLV (1984): 153—76; and John Haugeland, “‘Onto- 
ae Supervenience,” Southern Journal of Philosophy, xxO, Supplement 
(1984): 1-12. 

It is true that W. V. Quine—whom Miller ctes in addition to the above—can 
be read as resting with the nonmodal IPI as articulating his “physicalism.” But the 
supervenience/determination literature has been unanimous from the start in 
going beyond this, and, as indicated, Thompson and I explicitly mvestigated and 
rejected the IPI as a determination thesis. It is therefore ironic (to say the least) 
that Miller should seize on this very formulation as the basis for a critique of 
supervenience! 
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As Jaegwon Kim has suggested (in correspondence), however, the 
fact (if indeed it is one) that any two distinct (concrete) things in any 
nomologically possible world differ in some physical respect (even 
mere location) is hardly without metaphysical, conceptual interest. 
The standard involved here is just the necessitation of IPI above, 
and was designated ‘“‘weak supervenience”’ by Kim (op. cit.). Further- 
more, the corresponding claims of “identity of moral, or of psycho- 
logical, indiscernibles” (at least in their necessitated form) seem 
wholly implausible. 

Moreover, modal force must be combined with suitable restric- 
tions on the physical base. Not every relational property, for in- 
stance, even if expressed in physical language, counts as a relevant 
physical respect. In any real application of supervenience/determi- 
nation to physicalism or related theses, such restrictions are obvi- 
ously necessary and have been intended all along. Without them, the 
slightest difference in the physical ontology of two worlds would 
generate ‘physical distinctions” between counterparts (however in- 
discernible locally). For example, predicates of the form ‘coexists 
with. . .’, if admitted as physical, would trivially generate such irrel- 
evant distinctions. Again, the original formulation of “global physi- 
cal determination” was implicitly based on a list of physical predi- 
cates derived from actual scientific practice. Moreover, physical lo- 
cality can be used to motivate further severe restrictions.* Indeed, 
once this is recognized, even intraworld comparisons do not auto- 
matically lead to trivialization. (For example, absolute location may 
be deemed irrelevant in accordance with standardly adopted physi- 
cal symmetry principles; so may minor differences in various physi- 
cal magnitudes, and so forth.) Taking all this into account, it be- 
comes by no means trivial that, for instance, any two relevantly physi- 
cally similar persons must have the same mental experiences." 

Thus, the first (and, I think, decisive) step in seeing through 
Miller’s “‘goose-gander”’ argument is to realize that he has not con- 
sidered the genuine article: a live, robust goose, not a cooked one, 
and not a straw one either. 

(2) Concerning what the gander can do, Miller claims (700) to 
have considered “the toughest alleged counterexamples” to such 
reverse theses as ‘‘moral determination of the physical,” but in fact 
far tougher ones than any presented readily suggest themselves. So 


4 See “Determination and Logical Truth,” this JOURNAL, LXXXI, 11 (No- 
vember 1985): 607-16. 

5 As it stands, such a schematic principle is of course vague. Serious study of 
such a claim can only proceed by looking closely at the relevant sckences—in this 
case, the neurosciences and psychol: order to ascertain just what proper- 
ties are relevant as determinants of 
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long as the examples are restricted to a world with sentient life (not 
to say intelligent or human life, as in Miller’s own examples), a case 
could perhaps be made that any physical difference “‘makes a moral 
difference,” if we allow sufficiently far-flung dispositional predi- 
cates into the moral “‘base.’’ Consider, however, two worlds (or 
world stages) which differ physically but in which all forms of life are 
necessarily precluded (e.g., very early stages of the cosmos in a big- 
bang model in which matter as we know it cannot exist). (The physi- 
cal differences could be minor, or even major for that matter.*) 
What corresponding moral differences can there be between those 
worlds? By hypothesis, the possibility of moral agents (at any time in 
those worlds, if we like) is ruled out! It makes no difference to 
entertain what would be the case, were moral agents present, since 
the assumption of such agents involves a contradiction (with fixed 
background theory). Appeals to spiritualism that might occur at this 
juncture are quite beside the point. We may presuppose ontological 
physicalism, for the sake of argument, since it is a necessary, inde- 
pendent component of physicalism (with much going for it, one 
might add).” Implicitly, Miller must be granting this anyway, for, if 
spirits are allowed, obviously physicalist supervenience can readily 
fail, whereas Miller charges that it is instead trivially true. Similarly, 
we may insist that ‘moral’ means “moral by human standards” (or 
something perhaps more refined). The standards of other possible 
agents—possible spirits, God, what have you—are irrelevant. This 
much of our current usage must surely be held fixed! 

(3) The situation is even worse with regard to Miller’s claims on 
behalf of “phenomenal determination of the physical” (in his penul- 
timate paragraph, 701), where he says, among other things, that “‘if 
we make the physical differences fall below the threshold of human 
sense organs, there are always instruments. When all else fails, we 
can always postulate superhumanly sensitive perceivers to take up 
the slack.” - 

To take the latter suggestion first, in physical worlds in which 
sentience is not even physically possible, dispositional predicates in- 
volving “superhuman perceptual capacities” are without effect 
(again, unless a form of spiritualism is being embraced). We cannot 
effectively entertain what superperceptual difference a physical dif- 
ference would make in such worlds, for no worlds are countenanced 


*In my “Realist Principles,” Philosophy of Science, L, 2 (1983): 227-49, I 
explicitly considered such cases, in order to undermine the sort of tu quoque 
objection Miller makes, but in connection with phenomenaliam, rather than 
“moralism.” 

The need for purely ontological (physicalist) princi formulated indepen- 
eres of determination principles, was stressed in mpson and my op. cit. 
(1975). 
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in which the physical phenomena of these hostile worlds occur to- 
gether with any possible superperceivers.® 

With regard to the appeal to scientific instruments, two remarks 
are in order. First, they too are, a fortiori, physically impossible in 
the sorts of worlds (or world stages) under consideration, since, by 
hypothesis only radiant energy is possible—stable atoms cannot 
even be formed. So counterfactuals involving that which instru- 
ments—in the pertinent sense of epistemically accessible instru- 
menis—would measure are without effect. Second, it is outright 
cheating for the phenomenalist to make use of terms referring to 
physical measuring instruments in the first place (quite indepen- 
dently of the first point). Such terms are part of the (macroscopic) 
physical vocabulary, and unless they can be phenomenally recon- 
structed, their use in the present context could at best support 
merely a claim of determination of the physical by the phenomenal 
+ the macrophysical. But of course the situation is worse, for no one 
has ever come close to providing a thoroughgoing phenomenalist 
substitute for such macrophysical terms (one of the salient reasons 
being the vicious circularity of appealing to other instruments or 
material background conditions relevant to the perceptions an actu- 
ally unperceived object ‘“‘would give rise to,” as Wilfrid Sellars, 
J. J. G. Smart, and others forcefully argued decades ago). 

Thus, when properly framed, determination relations are not only 
nontrivial; they point to deep asymmetries among major alternative 
metaphysical frameworks that have inspired philosophical inquiry 
over many centuries. 

This is, of course, not to say that there is a unique (minimal) 
adequate determination base; the prospect of multiple alternative 
such bases (ones not even mutually intertranslatable) has also been 
recognized.’ Nor is it to say that asymmetry always prevails. In fact, 
the article under consideration here does suggest an amusing way of 
so stretching the notion of “morally relevant property” that an ex- 
ample of codetermination may be in the offing. 

To see how this can arise, consider the following sort of scenario: 
Dr. Strangelove (pretend in the actual world) has his finger on the 


* Concerning early-cosmos type examples: if one wishes to consider temporally 
complete cosmological models, then care must be taken to ensure either that 
sentence is ruled out for all time (in both worlds considered), or that the early 
differences remain forever causally and epistemically inaccessible The latter is 
readily arranged, as evolution of the cosmos is described, e.g., by Stephen 
Weinberg in The First Three Minutes (New York: Basic, 1977) 

Such complexities could be bypassed by choosing even simpler examples, e g, 
worlds with a very small number of elementary particles and minuscule total 
energy, so that (given our fixed background theory) all possibility of sentience 
ever arising is ruled out. 

® See, for example, the “codetermination” standard of theoretical equivalence 
in Thompson and my op. cit. (1977). 
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button and resolves to push it just in case (he ascertains that) there is 
a possible world of type A that also has arbitrary physical property 
B, where A is possessed by both worlds of any proferred counterex- 
ample to “moral determination of the physical” and B is the alleg- 
edly morally irrelevant physical property distinguishing one of the 
worlds from the other. (B can be as trivial as you like, e.g., a minor 
density fluctuation, a particular value of momentum of a particle, 
whatever.) Now, B appears to be “morally relevant,” for the world 
that satisfies it also satisfies ‘gives Strangelove an excuse to blow up 
the Earth’! (Strangelove, being an accomplished physicist, may even 
set about trying—singlehandedly—to deduce whether an A & B 
model is a solution to, say, the Einstein equations!) Therefore, al- 
lowing in dispositions such as ‘could give a (possible, if not actual) 
Strangelove an excuse to. . .’, we can sustain even a modalized 
version of “no physical difference without a (Strangelovian) moral 
difference”! What should one say about such a line? 

Of course, on balance the worids of any such example are morally 
on a par (in any ordinary sense of “equally good or bad”), since 
Strangelove (or a Doppelganger) could interchange the roles of B 
and not-B, etc. Still, one might say such alternative scenarios just 
correspond to so many more morally relevant differences that B 
makes, whose total effect is to leave B and not-B “equally good.” 

One thing to say is that a predicate such as ‘gives a (possible) 
Strangelove an excuse to. . .’ at best shows that a given prima facie 
morally irrelevant physical property could become morally relevant, 
not that it in fact “makes a moral difference.” In contrast, the 
dependence of the psychological, say, on the physical is not merely a 
potential dependence. It is not merely that a mental event, say, a 
feeling of revulsion, might be attendant on neural events constitut- 
ing, say, a reaction to perception of mass slaughter; it is that it is 
attendant on such things, and in lawful ways that science may il- 
luminate. 

Another thing to say is (the obvious) that such Strangelovian prop- 
erties are “morally adventitious”: they have nothing to do with in- 
trinsic moral worth. And if it is claimed merely that the morally 
adventitious determines the physical, that may be granted as a 
truism (assuming we are allowed to play the above (Peter) Sellersian 
game). Indeed, one would not attach great metaphysical significance 
to this thesis. We see it as an artifact of the (stipulated) open-ended- 
ness of “morally relevant property.” This, however, hardly impugns 
the metaphysical relevance of supervenience/determination. The 
most that it shows is that, indeed, there are some (metaphysically) 
uninteresting cases. 

GEOFFREY HELLMAN 
University of Minnesota 
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Pursuit of Truth. W. V. QUINE. Cambridge: Harvard University 
Press, 1990. viii, 113 p. Cloth $17.50. 


The aims of this little book are, in W. V. Quine’s own words, to 
“update, sum up, and clarify” his “variously intersecting views on 
cognitive meaning, objective reference, and the grounds of knowl- 
edge” (vii). A marvel of compressed exposition, it should appeal to 
long-time students of Quine, who want to keep abreast of his 
current thinking, as well as to newcomers seeking an introduction to 
the views of the most influential American philosopher of his gen- 
eration. 

Though not drastically revisionary, the book shows a general drift 
toward a softening of some of Quine’s more radical theses. The 
familiar Quinean themes are all in evidence, but frequently played 
in more muted tones. Take Quine’s conception of epistemology: 
while still holding that there is no first philosophy, firmer than 
science and serving to “ground” it, Quine retreats from his some- 
time view that this turns epistemology into a straightforward chapter 
of psychology. He now maintains that, within the “baffling tangle of 
relations between our sensory stimulation and our scientific theory 
of the world, there is a segment that we can gratefully separate out 
and clarify without pursuing neurology, psychology, psycholinguis- 
tics, genetics, or history. It is the part where theory is tested by 
prediction. It is the relation of evidential support, and its essentials 
can be schematized by means of little more than logical analysis’’ 
(1-2). The project of Pursuit of Truth sounds less like the undertak- 
ing announced in “Epistemology Naturalized’’! and more like philo- 
sophy as the positivists understood it. 

Quine’s well-known holism also appears in a subtly moderated 
form. The radical thesis that the unit of empirical significance is the 
whole of science gives way to the modest claim that a testable set of 
sentences needs ‘‘critical semantic mass,’’ which means it has to be 
“pretty big” (17). Holism corrects “the naive conception of scien- 
tific sentences as endowed each with its own separable empirical 
content” (16). But Quine distances himself from those coherence 
theorists, some inspired by Quine himself, who hold that all infer- 
ence is inference to some total view of the world. 


‘In Ontological Relativity and Other Essays (New York: Columbia, 1969), 
pp 69-90. 
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On the issue of meaning, Quine makes no retreats: determinate 
meanings are a myth. For a holist, “[t]rue sentences, observational 
and theoretical, are the alpha and omega of the scientific enterprise. 
They are related by structure, and objects figure as mere nodes of 
the structure” (31). Since “what is empirically significant in an ontol- 
ogy is just its contribution of neutral nodes to the structure of the 
theory,” systematic reinterpretation of a theory's ontology gives us 
two theories empirically indistinguishable as wholes. This thesis of 
ontological relativity, or the inscrutability of reference, concerns 
only mutually compensating interpretations of words within sen- 
tences. But the more dramatic thesis of the indeterminacy of trans- 
lation ‘‘declares for divergences that remain unreconciled even at 
the level of the whole sentence, and are compensated for only by 
divergences in the translations of other whole sentences” (50). Since 
meaning is whatever is captured by translation, if translation is inde- 
terminate, so is meaning. But since all that matters in translation is 
“fluency and effectiveness of dialogue” (59), determinate meanings 
are dispensable, even for linguists and lexicographers. 

Not so for irreducibly intensional discourse. Our practice of attri- 
buting beliefs and other propositional attitudes to our fellow 
speakers “harbors indispensable lore about human activity and mo- 
tivation” (71). And though we do well to keep such talk out of 
science, which can then enjoy “the crystalline purity of extensiona- 
lity” (71), there is no principled barrier to “science on the inten- 
sional side too, insofar as there are observational checkpoints how- 
ever tentative” (72). The main thing is to avoid mixing vocabularies. 
Quine commends Donald Davidson’s “anomalous monism” or “‘to- 
ken physicalism”: we can aspire to a fully extensional global theory 
of the world, but we still need an alternative, intensional way at 
looking at some of the events it covers. We may end up with “multi- 
ple scientific theories, jointly true” (73). (Truth, incidentally, is still 
just disquotation.) So where Quine once wrote of the irreducibility 
of the intensional as showing ‘‘the baselessness of intensional idioms 
and the emptiness of a science of intention,”? his attitude today is 
notably forgiving. 

But while many of Quine’s thoughts became clearer to me, some 
did not. In particular, I find Quine’s conception of a “naturalized” 
theory of knowledge—its task and its relation to traditional episte- 
mology—very murky. Quine has hinted both that naturalized episte- 
mology replaces attempts to refute the skeptic and that it dissolves 


2? Word and Object (Cambridge: MIT, 1960), p. 221. 
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skeptical doubts. But here interest in skepticism is conspicuous by its 
absence, so matters are not clarified. And what he does say is not 
helpful. He repudiates “the Cartesian dream of a foundation for 
scientific method, firmer than scientific method itself.” Neverthe- 
less, he remains “occupied with what has been central to traditional 
epistemology, namely the relation of science to its sensory data” 
(19). But the traditional notion of “sensory data” is precisely that of 
a foundation for empirical knowledge. So I do not see how one can 
reject the dream of a foundation for science, while retaining an 
interest in anything like the traditional problem of connecting our 
knowledge of the world with our sensory data. 

I suspect that Quine has no clear conception of naturalized episte- 
mology and that the source of his confusion is his use of “‘stimula- 
tion,” which is a peripheral matter only physiologically speaking. A 
stimulation, defined as the set of all a given subject’s triggered ex- 
teroceptors (3), is just a neurological event. But he also treats stimu- 
lations as the “data” from which we “project our theory of the 
external world.” He sees his interest in “neural input” as epistemo- 
logical, precisely because it concerns “‘the flow of evidence from the 
triggering of the senses to the pronouncements of science” (41). It 
is not easy to see how evidence can flow from things that are not 
even beliefs. To bear on the traditional problem of justifying our 
beliefs about the external world, stimulations must carry a distinc- 
‘tive content, graspable prior to knowledge of the world. This takes 
us back to the quest for a foundation. 

For Quine, evidence proper comes in with observation sentences, 
which for the most part already concern external objects. Again, 
observation sentences may be prompted by stimulations but they are 
not “projected” from them. In fact, there is not even an explana- 
tory, never mind a justificatory, connection between stimulations, 
classified according to physical criteria, and assent to observation 
sentences. Although acquiring an observational vocabulary may de- 
pend on a phase of conditioning to stimulation, once observation 
sentences become embedded in a set of sentences of critical seman- 
tic mass, the only condition under which they test anything, they 
cease to be under such tight stimulus control, for what stimulations 
prompt assent to a given observation sentence becomes sensitive to 
collateral knowledge. Holism intervenes: mature observation sen- 
tences have no empirical content, hence no stimulations, which they 
can call their own. Indeed, to try to explain beliefs by appeal to 
stimulations would be to mix extensional and intentional talk in the 
very way anomalous monism warns us against. As Quine himself 
notes, epistemologically relevant similarity, with respect to stimula- 
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tions, is perceptual not receptual: similarity of input is determined 
by similarity of response. 

Why the confusion? Because Quine wants to think that, purged of 
its dogmas, empiricism remains science’s philosophy of science. 
Stimulations as data are a faint, lingering echo of the myth of the 
given. Thus Quine tells us that “the most notable norm of natura- 
lized epistemology actually coincides with that of traditional episte- 
mology”: nihil in mente quod non prius in sensu (19). But by 
Quine’s own showing, nothing epistemologically relevant is “in the 
senses,” as Quine understands the senses. So this “watchword of 
empiricism” no longer carries its traditional sense, reducing to a 
banal warning against ‘‘telepaths and soothsayers.” Only Quine’s 
tendency to treat stimulations sometimes as mere neurological 
events and sometimes as constituting a kind of primitive, experien- 
tial knowledge, allows him to see a close connection between episte- 
mology as he understands it and its traditional empiricist forbear. 
Resolve the tension and the connection disappears. Much of 
Quine’s naturalized epistemology disappears with it, however. 

Quine’s holism also works against the idea that there is much to be 
said about scientific method. Logical analysis shows, as Karl Popper 
has insisted, that ‘“‘[p]ure observation lends only negative evidence, 
by refuting an observation categorical that a proposed theory im- 
plies” (13). But holism ensures that there is never just one way of 
accommodating a recalcitrant observation. There are vague maxims, 
such as that of minimum mutilation. But there are no rules. Still less 
is there a science of how we project theories from data. 

This is a short book. But I fear that for a philosopher who re- 
nounces empiricism’s dogmas and shows no sustained interest in the 
roots of skepticism, epistemology is a short subject. 

MICHAEL WILLIAMS 
Northwestern University 


Russell, Idealism, and the Emergence of Analytic Philosophy. 
PETER HYLTON. New York: Oxford University Press, 1990. xv, 420 p. 
Cloth $69.00. 


At my university (Cambridge) the content of courses in the history 
of philosophy stops, conveniently, around the year 1900. Philo- 
sophers whose work comes after that date are dealt with in courses 
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on “Metaphysics,” “Logic,” etc. As a result, our students read Ber- 
trand Russell in the same context as that in which they read Donald 
Davidson; they engage with Russell as if he were an emeritus profes- 
sor, only recently retired. This tradition owes much to Russell him- 
self. It seemed to him that, as the new century began, he was escap- 
ing from the old cave of idealist appearances into the clear light of 
logic, mathematics, and, thanks to G. E. Moore, goodness. So al- 
though Russell thought that the works of previous philosophers 
were worth studying, the tradition that derives from his work—the 
analytic tradition in philosophy—cannot readily accept that by now 
his works just belong to the history of philosophy: for that might 
suggest both that his break with the past was not as sharp as it 
seemed, and that our current debates do not just continue his con- 
cerns. It might even suggest that, if he is the paradigm analytic 
philosopher, we are not. 

Peter Hylton, himself a Cambridge graduate, aims precisely to 
insert Russell into the history of philosophy by acquainting us with 
his earliest works and refusing to engage in debates with him (except 
in the odd footnote). Thus the style of his book is altogether differ- 
ent from that of, say, Mark Sainsbury’s Russell.’ Whereas Sainsbury 
pursues themes from Russell down to contemporary debates in 
order to evaluate Russell’s ideas from a current point of view, Hyl- 
ton aims to set up a historical distance between Russell and our- 
selves so that we can appreciate the differences between Russell’s 
views and our own. Admittedly, Hylton also wants to make contem- 
porary analytic philosophers aware of those features of their meth- 
ods which are historically explicable in the light of Russell’s work. 
But his proposals in this area are rather sparse and questionable— 
e.g., that analytic philosophy has been decisively influenced by the 
thesis that ‘‘mathematical logic provides the crucial method of 
philosophy.” 

In his opening chapters, Hylton sets out to estrange us from the 
all-too-familiar author of “On Denoting” by introducing us to the 
unfamiliar territory of British idealism, as represented by T. H. 
Green, F. H. Bradley, and J. M. E. McTaggart. His aim is to get 
behind the superficial understanding of these philosophers which 
Russell’s criticisms of them manifest, so that we can appreciate both 
Russell’s own early writings, especially The Foundations of Geome- 
try, and the complexity of the issues at stake in the subsequent 
debates between Russell and Bradley. Hylton indeed urges that 


1 New York: Routledge, 1979. 


BOOK REVIEWS 53 


many of Russell’s criticisms of Bradley fail to engage with Bradley's 
thought because Russell already makes assumptions that Bradley 
would reject. Thus their debates about the reality of relations are 
largely pointless because Russell does not do justice to Bradley's 
conception of reality. By and large I agree with Hylton on this mat- 
ter, though I do not think that even he succeeds in making the 
intellectual grounds for Bradley’s ascent to the Absolute comprehen- 
sible. But there is one point which Hylton stresses which seems to 
me to need qualification. This concerns the issue of degrees of 
truth: Hylton takes this to be one of the fundamental disagreements 
between Russell and Bradley which is never properly thematized in 
their disputes. That there is a disagreement here is, of course, indis- 
putable; but I think one’s attitude to it is modified when one realizes 
that Russell’s philosophy incorporates a variant of Bradley’s rela- 
tive/absolute distinction in its different attitudes to common-sense 
analysanda and to their philosophical analyses. Even a Moorean 
insistence on the certain truth of our common-sense judgments can- 
not disguise the fact that, once it is allowed that these judgments 
stand in need of analysis, their truth becomes relative to that of their 
analysis. 

In describing Russell’s rejection of idealism, Hylton rightly 
stresses the decisive role of the young Moore. Hylton characterizes 
Moore’s early philosophy as “Platonic Atomism.” The atomism is 
certainly there, and is central to Moore’s rejection of Bradley’s to- 
talitarian holism. I am less happy about the Platonist comparison; 
for Moore did not think that there is a real world of forms (or 
concepts) behind ordinary appearances. Instead, he thought that 
reality extends beyond that which actually exists—the possible is as 
real as, and of the same ontological type as, the actual (so the appro- 
priate comparison is with David Lewis). Hylton’s misjudgment here 
leads him to overlook the basis of Moore’s views about necessity, 
about which he puzzles a good deal. In fact, Moore just has a Leib- 
nizian view, which he expresses in terms of the universality of neces- 
sary truths. 

Hylton devotes his longest chapter to The Principles of Mathe- 
matics. He shows first how Russell combined Moore’s philosophy 
with the new logical techniques he had learnt from Peano to 
put together a powerful anti-idealist polemic on the topic of infinity 
and thus “to play the part of Samson” in relation to Kant and his 
disciples. And on this matter Hylton does seem content to allow the 
validity of Russell’s criticisms of idealism. But, Hylton argues, this 
judgment should be qualified by a recognition of the fact that Rus- 
sell lacked a workable account of his own logical theory. The central 
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issue here is that of “the variable,” which Russell discusses in con- 
junction with his theory of denoting concepts and propositional 
functions. Hylton suggests that Russell had hoped to handle vari- 
ables by simply treating the expression ‘any term’ as expressing a 
denoting concept, but found that his theory of denoting concepts 
was not adequate to the complexities of multiple quantification: as a 
result, Russell was compelled to treat the conceptions of a proposi- 
tional function and of a variable object as fundamental, although he 
was not at all content with this conclusion. This seems to me largely 
correct: and Russell’s discussion of these issues does now seem strik- 
ingly archaic, though I suspect that the techniques developed by 
W. V. Quine in “Variables Explained Away” (in his Selected Logical 
Papers?) could be exploited to give Russell’s theory of denoting 
concepts a better chance of success. 

The stage has now been set for “On Denoting.” Hylton shows 
persuasively that the old theory of denoting concepts could handle 
the puzzles concerning empty names and referential opacity with 
which Russell introduces the new theory. So the motive for the new 
theory must come from some other dissatisfaction with the old 
theory. In part, it just must have been the perceived need anyway 
for a separate treatment of variables; but Hylton also provides a 
plausible reconstruction of the notorious “The First Line of Gray’s 
Elegy” argument which both delivers the requisite conclusion, that 
the relation between a denoting concept and its denotation seems 
inherently inexpressible, and avoids the imputation of gross use/ 
mention confusions. Although Hylton appears to think that this ar- 
gument is satisfactory from Russell’s point of view in 1905, I did not 
find it convincing, since the reconstructed argument just assumes 
that denoting concepts cannot ever occur as nondenoting terms 
(Hylton even cites a passage from an unpublished manuscript in 
which Russell appears to question this assumption). My own view (to 
which Hylton also alludes briefly) is that Russell was always under 
pressure to abandon the theory of denoting concepts because it was 
inconsistent with a fundamental premise of his “Platonic Atomism”’ 
—that true propositions do not represent, or denote, the facts— 
they are themselves the facts. Therefore, once Russell had worked 
out how to handle the problematic case of definite descriptions 
without the old theory, he was keen to abandon it altogether— 
though Hylton intriguingly suggests that the relation between a 
propositional function and its values is not so very different from 
that between a denoting concept and its denotation. 


* New York: Random, 1966. 
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Russell’s theory of descriptions leads into his conception of in- 
complete symbols and the theory of logical fictions. But this line of 
thought, central to Russell’s analytic program, intersects with an- 
other that also has its origins in The Principles of Mathematics— 
the theory of types. Russell introduces this to cope with his paradox; 
in the context of The Principles of Mathematics, however, as Hyl- 
ton shows, the theory conflicts with Russell’s assumptions that any- 
thing can be a term, and that all terms have the same status. Since 
these assumptions are central to “Platonic Atomism,” Russell’s sub- 
sequent development of the theory of types marks a significant re- 
treat from that philosophy, and in the last part of his book Hyiton 
discusses similar points, such as the replacement of propositions by 
the multiple-relation theory of judgment and the abandonment of 
the naive direct realism characteristic of the epistemology of his 
early thought. Hence, as Hylton progressively reintroduces us to 
that old friend, the author of The Problems of Philosophy, we find 
we have to bid farewell to the stranger whose acquaintance we have 
just been making—the author of The Principles of Mathematics. 

There is much in Hylton’s long book that I have not discussed 
(e.g., his treatment of Principia Mathematica). I have concentrated 
on his central theme, the rise and fall of “Platonic Atomism,” and in 
his discussion of this, Hylton, I think, succeeds brilliantly. As I have 
indicated, I have some reservations about details, but the overall 
conception and achievement is extraordinary. Hylton’s book pro- 
vides an exemplary combination of scholarship, lucidity, and imagi- 
nation. He sets a new standard for all subsequent discussions of 
Russell’s philosophy. 

THOMAS BALDWIN 
Cambridge University/Clare College 
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SOUR GRAPES AND CHARACTER PLANNING* 
“Fit yourself into accord with the things in which your position has 
been cast, and love the men among whom your lot has fallen, but 
love them truely.” 
Marcus Aurelius 


Lebensregel that one should mold one’s desires such that 

they come to match whatever it is that life has in store. Aside 
from being distrustful, I remember being puzzled by this ancient 
advice. What is puzzling about it is that, though the advice is well- 
taken in certain cases, it is strictly off the mark in others. Imagine a 
person who switched her preference for pepsi or coke back and 
forth depending on what the nearest vending machine had to offer. 
Similarly, Nietzsche’s Zarathustra “laughed at the weaklings who 
thought themselves good because they had no claws.” Yet, at the 
same time, there is something very respectable about the capacity to 
exercise control over one’s own emotions and desires. It was argued 
by Harry Frankfurt? that freedom of the will precisely consists in 
being able to control what desires will issue in action. And what 


N S a freshman in college I was first exposed to the Stoic 


* This paper is a shortened version of the first beaded E trea: 
Reasons for Preferences, Univeraty of Minnest/Minveapols T also thank the 
National Fund for Scientific Research and the National Science Foun- 
dation (U.S.A.) for their financial support. Í am grateful to the numerous individ- 
uals who have made contributions to this paper. C. A. Anderson, George Bealer, 
Willem deVries, Kevin Falvey, Ronald Giere, Jean Ladri¢re, Andrew Mason, H.E. 
Mason, Michael Otsuka, Michael Root, Naomi Scheman, and Yutaka Yamamoto. 

1 Thus Spoke Zarathustra, in The Portable Nietzsche, W. Kaufmann, ed. 
(New York: Viking, 1954), p. 230. 

“Freedom of the Will and the Concept of a Person,” this JOURNAL, LXVII, 1 
(January 14, 1971): 5-20. 
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could be wrong with choosing one’s desires so as to avoid disap- 
pointment and frustration? 

This tension in the Stoic advice is captured very crisply in Jon 
Elster’s® discussion of sour grapes and character planning. These 
phenomena both involve a shift of desires in response to what one 
takes to be feasible or morally correct. For less respectable cases of 
this kind, the term sour grapes (SG) comes to mind. For more re- 
spectable cases, Elster reserves the term character planning (CP). 
He goes on to ask what distinguishes SG from CP. It is this question 
which articulates my earliest apprehension over the Stoic advice. 
Elster appeals to the autonomous nature of CP that is lacking in SG. 
I shall question this stand and defend an alternative solution that 


runs parallel to Donald enon account of akrasia. 


Elster extensively discusses the question what preferences must be 
like in order for them to be rational. Preferences acquired through 
SG are a paradigm case of irrational preferences. The fox in La 
Fontaine’s fable has an appetite for grapes, though when he finds 
out that he cannot reach for them, he claims that grapes are too 
sour for his canine taste anyway. (On this reading of the fable, the 
fox does not change his beliefs about the sourness of the particular 
grapes in question, but rather changes his tastes concerning the 
sourness of grapes in general.*) Elster contrasts preference acquisi- 
tion through SG with preference acquisition through CP. It is a 
commonplace in Aristotelian ethics that an action cannot qualify as 
a virtuous action unless the agent performs it with a sense of joy. 
This sense of joy is acquired through habituation. Imagine a poker 
player who enjoys the game tremendously because of the opportuni- 
ties for cheating involved. She comes to realize that cheating is mor- 
ally reprehensible and decides to start a better life on this score. 
Initially she does not find fair-play poker terribly exciting. Nonethe- 
less, in her quest for Aristotelian virtue, she is committed to becom- 
ing the kind of person who enjoys fair play. And in order to carry 
through this project of CP, she chooses the Aristotelian route of 
habituation. That is, she decides not to give in to her initial misgiv- 
ings and to stick to fair play, hoping that some day she will thus 
come to enjoy this style of playing. 

This Aristotelian scheme can be extended beyond the moral 


* Sour Grapes (New York Cambridge, 1983), See 117-9; fe pin 
Technical Change (New York: Ganhei, 1983), pp. 84-8; Rational Choice, 
Elster, ed. (New York: Blackwell, 1986), pp. 14—5. 

*C£. Sour Grapes, p. 123n 
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realm. The poker player’s choices were constrained by moral consid- 
erations. But choices may also be constrained simply because alter- 
natives are no longer feasible. Imagine that the management of the 
casmo decided to install proper lighting, such that the opportunities 
for cheating are entirely blocked. Our poker player is not at all 
pleased with this new arrangement. Nonetheless, she decides to stick 
with it and to try to develop a liking for fair play. It may well turn 
out that, after a few games, she will indeed come to enjoy fair-play 
poker quite a bit better than her former style. 

What is puzzling about SG and CP is that both phenomena can be 
described m the same way, namely, both the fox and the poker 
player adjust their preferences in reference to what they believe to 
be feasible alternatives. Nonetheless, while the fox is an epitome of 
irrational preference acquisition, the same judgment could hardly 
be passed on to the poker player. Nothing stands in the way to say 
that her preferences took shape in an entirely rational fashion. So 
what is it about SG and CP that makes us pass such radically differ- 
ent judgments on these phenomena? The phenomena fit the same 
description, so what is left out of this description which can account 
for this distinction? Before I turn to Elster’s solution to this puzzle, I 
shall shortcut a few incorrect, though nonetheless very telling sug- 
gestions. 

(a) It may be suggested that our puzzle is ill-phrased in that it 
assumes there actually are two phenomena in play. One might argue 
that ‘SG’ and ‘CP’ describe one and the same phenomenon, which is 
irrational in some respects and rational in other respects. If one 
wants to highlight the irrational character of a particular preference 
adjustment in reference to the feasible alternatives, then ‘SG’ is the 
appropriate term. If one wants to highlight the rational character of 
the very same case, then ‘CP’ is the term that comes to mind. I admit 
that, for a great many cases of preference adjustment in reference 
to the feasible alternatives, it is not clear whether to categorize them 
as instances of SG or as instances of CP. Arguments can be made on 
either side of the fence and no empirical information is lacking 
which could decide the debate. One cannot deny, however, that 
there are also clear-cut cases around. It is simply incorrect to de- 
scribe the fox’s preference adjustment as an instance of SG. The 
existence of such clear-cut cases is sufficient to safeguard our 
puzzle. This is nonetheless a very telling suggestion. A correct ac- 
count of SG and CP should also explain why there is such a broad 
“gray” area between both phenomena, i.e., why it is that so many 
cases seem to have something of SG as wef as something of CP 
about them. 
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(b) It may be suggested that, whereas the poker player takes her 
time to come to appreciate fair play, the fox renounces his appetite 
for grapes at the spur of the moment. But how could this difference 
in timing account for our judgments concerning the irrational char- 
acter of SG and the rational character of CP? Furthermore, we can 
well imagine a fox who passes through a phase of denial, anger, or 
resignation before he finally makes the SG move or a CP poker 
player who is very adaptable and in no time develops an apprecia- 
tion for fair play. Nonetheless, this suggestion is very telling in that 
CP typically involves more of a project than SG. If a preference for 
an alternative that is no longer feasible is abandoned at the spur of 
the moment, there is a presumption of irrationality. This is not the 
case if such involves a gradual process. A correct account of SG and 
CP should also provide an insight into what underlies this pre- 
sumption. 

(c) It may be suggested that, whereas the poker player actually did 
change her preference, the fox merely engages in an act of self-de- 
ception. The phenomenon of SG does not consist in a preference 
adjustment, but rather in constructing a false belief about a stable 
preference for a nonfeasible alternative. I admit that there are genu- 
ine cases of self-deception in which a person forms a false belief 
about her preference for a nonfeasible alternative. In such cases, 
one will come to abandon this false belief upon sincere introspec- 
tion or would indeed choose for the alternative if it were suddenly 
made available. Yet I want to claim that there are cases in which a 
person actually does change her preference because some alterna- 
tive is no longer feasible and which do not qualify as instances of CP. 
Suppose that, upon sincere introspection, one would still hold on to 
one’s adjusted preference or, if the nonfeasible alternative suddenly 
became feasible once again, one would turn away from it. I want 
‘SG’ to refer to such cases of irrational preference acquisition, not 
to cases in which a false belief is being formed about a stable prefer- 
ence. Upon sincere introspection, the fox would still claim to have 
lost his appetite for the grapes and, if we were to lower the vine for 
him, he would turn away his head. Nonetheless, there is something 
very telling about this suggestion, in that, for each case of SG, there 
is a persistent temptation to give a description in terms of self-de- 
ception about a fixed preference. A correct account of SG should be 
able to explain this temptation. 


n 
In trying to solve the puzzle, Elster argues that preferences are 
rational only if they are rationally acquired and the criterion for 
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rational preference acquisition is autonomy. This criterion is meant 
to distinguish SG from CP. Unlike the fox’s whims, the poker 
player’s preference reversal in favor of fair play is autonomous in 
character. What is distinctive about autonomous preference acquisi- 
tion is that it is open to intentional explanation." Elster distinguishes 
between three types of scientific explanation, namely, causal, func- 
tional, and intentional. He appeals to Davidson’s theory of action to 
fill in his account of intentional explanation. 

Some of our doings, i.e., some of the things we do, are actions. 
Actions are doings that are intentional under some description. 
Other doings, like tripping over the carpet, are not intentional 
under any description. These are not actions, they are mere doings. 
Davidson argues that 


M) A doing (and hence an action) is intentional under some descrip- 
tion d, if and only if 
(a) there is some subset of the agent's mental states which provides 
for a reason for the doing under d, 
(b) these mental states were causally efficacious in bringing about 
the doing, 
(c) and this causal-efficacity is of the correct type. 
The set of mental states, which provides for a reason for an action 
under some description, contains a cognitive component (a belief) 
and an evaluative component or a pro attitude. For instance, a rea- 
son for the action under the description ‘adding sage to the stew’ is 
the agent’s pro attitude or desire to improve the taste of the stew 
together with the belief that adding sage to the stew will improve 
its taste. 

Let us return to Elster’s theory. A doing is open to intentional 
explanation if and only if there exists a description under which the 
doing is intentional. Preference acquisition is a doing for both the 
fox and the poker player. Here is how the distinction comes in 
between SG and CP. The poker player’s preference acquisition is 
open to intentional explanation, i.e., there exists a description of 
her preference acquisition under which it is intentional. She inten- 
tionally adjusts her preferences in reference to the feasible alterna- 
tives. The fox’s preference acquisition is closed to intentional expla- 
nation. He does not intentionally adjust his preferences in reference 


, Explaining Technical Cha PP- 
* Actions and Events (New Y. ond I 1980), pp. 4-12, 78-9, 86-7. Notice 
that Davidson himself does not use the expressions ‘a doing’ and ‘a mere doing’. 
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to the feasible alternatives, nor does there exist any other descrip- 
tion under which his doing is intentional. Consequently, the poker 
player’s preference acquisition is an action, while the fox’s prefer- 
ence acquisition is a mere doing. We can thus generalize: preference 
acquisition through CP is an action, while preference acquisition 
through SG is a mere doing. 

What accounts for this distinction? Elster argues that the former 
phenomenon is caused by a metadesire to adjust one’s preferences, 
while the latter is caused by an “affective drive” that aims at reduc- 
ing the frustration from unsatisfiable preferences. Metadesires, un- 
like drives, are pro attitudes and thus may qualify as an evaluative 
component in the set of mental states which provides a reason for an 
action under some description. Drives, on the other hand, can be no 
more than causes of mere doings. But what is so different about 
drives and desires which could motivate this distinction? 

Elster believes that drives differ from desires in that they “are not 
conscious and known to the person who has them.’ I have first- 
person knowledge of my desires as the reasons for my actions. I do 
not have first-person knowledge of my drives as the causes of my 
doings. Secondly, unlike drives, desires “may forgo short-term plea- 
sure to achieve some longer-term gain.’’”® Prudence can be an attrib- 
ute only of desires, not of drives. And thirdly, Elster more than once 
takes recourse to mechanistic metaphors to refer to drives. Accord- 
ing to Davidson, common mental events are in principle open to 
both a full physical description and a full mental description (op. 
cit., pp. 245-9). Elster believes that drives are open only to a full 
physical description. Neuroscience does not yet allow for such de- 
scription, however, and hence for the individuation of drives we 
must take recourse to their behavioral effects. If we want to describe 
the causal agents of these behavioral effects as such we can only 
invoke metaphors, like “forces” or “the wirings of the pleasure 
machine,’’!° 

Elster admits that his characterization of the set of autonomous 
preferences is merely tentative. First, he wants to exclude prefer- 


7 Sour Grapes, p. 24. 

* Ibid. Cf. ibid., p. 21; Rational Chosce, p. 15; Explaining Technical 
Change, p. 87 

° Sour Grapes, p. 25. 

1° Sour Grapes, p. 25; Explaining Technical Change, pp. 71-2, with refer- 
ence to A. Tversky, “‘Self-deception and : Some Psychological Ob- 
servations,” paper presented to a colloquium on ‘ Multiple Self” (Paris: Mai- 
son des Sciences de l'Homme, 1982). 


SOUR GRAPES AND CHARACTER PLANNING 63 


ences shaped by metadesires that are themselves nonautonomous in 
character. Second, he acknowledges that some preferences are fully 
autonomous, though they were not intentionally acquired. He men- 
tions socialization within a particular culture and learning to appre- 
ciate alternatives by simply trying them out as nonintentional pro- 
cesses of preference acquisition. One could hardly deny that at least 
some preferences so acquired are autonomous in character.?! 


im 

The core of Elster’s theory is that preferences are rational only if 
they are rationally acquired and that preferences are rationally ac- 
quired if and only if they are auttonomousty—that is, intentionally— 
acquired. Smce in CP preferences are intentionally acquired and in 
SG they are not, the former preferences are rational, the latter are 
not. I shall first argue that Elster’s understanding of intentional 
preference acquisition is not in line with his professed commitment 
to Davidson’s analysis of intentional action, nor with his own com- 
mitment to rationalchoice theory. Subsequently, I shall challenge 
intentional preference acquisition on a strict Davidsonean reading 
as a criterion for rational preferences. 

Consider the following application of Davidson's analysis (I) to 
intentional preference acquisition: 

(i) A person intentionally acquires a preference if and only if some 

subset of her mental states provides for a reason for her acquiring the 

preference and is causally efficacious to this effect in the correct way. 
Elster argues that preference acquisition through drives is not in- 
tentional, since drives are not conscious and known to the person 
who has them. This suggests a commitment to the claim that a per- 
son intentionally acquires a preference only if she has first-person 
knowledge of her reasons for doing so. Yet this claim is not in line 
with Davidson’s analysis of intentional action. Rather, it suggests the 
following requirement: 

(i) An action is intentional under some description d only if the agent 

has first-person knowledge of her reasons for doing d. 

Davidson argues convincingly against the inclusion of this condition 
in his analysis of intentional action.'* Secondly, claim (i) is incompati- 
ble with two other claims to which I believe Elster is committed: 


"| Sour Grapes, pp. 22, 112-4, 138-9; Explaining Technical Change, pp. 
84-7 


"? Davidson argues that a person may intentionally poison Charles, though 
nonetheless be mistaken about her reasons for doing so. For instance, she may 
believe that her reason for doing so is to save Charies pain, while her actual 
reason is to have him out of the way (op. ctt., pp. 17-8). 
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(ii) Rational-choice theory can tell us more about why some person 
performed an action than a genuinely honest actor could (I take 
‘rational-choice theory’ to stand for the body of theories that ex- 
plain actions by means of decision-theoretic or game-theoretic 
models); 

(ii) Explanations in rational-choice theory are paradigmatic cases of 
intentional explanation. 

Elster is clearly committed to (iii).!* Consider claim (ii). Many suc- 
cessful explanations by means of decision-theoretic or game-theo- 
retic models unveil preferences of which the agents do not care to 
have first-person knowledge. The agents tell more flowery stories 
about the springs of their action and, what is more important, they 
are genuinely honest when they tell these stories. Considering the 
actual practice of rational-choice theory, I believe that Elster will 
acknowledge that this theory brings out springs of actions of which 
the agent does not have first-person knowledge. 

Now consider these three claims. If rational-choice theory pro- 
vides for a form of intentional explanation (on (iii)), it lays out the 
springs of actions that are intentional under some description. On 
(ii), these springs are at least sometimes beyond one’s first-person 
knowledge. On (i), they must be within one’s first-person knowl- 
edge. This is how the inconsistency comes in. Since Elster is commit- 
ted to (ii) and (iii), he cannot hold that first-person knowledge of the 
reasons for one’s actions is a necessary condition for intentional 
action. Hence, he cannot include first-person knowledge of the rea- 
sons for a preference acquisition as a necessary condition for inten- 
tional preference acquisition. 

Let us now respect Elster’s professed commitment to Davidson’s 
analysis of intentional action and consider whether analysis (I**) 
could be a plausible criterion in Elster’s theory. $ 

First there is a challenge of principle. Preference acquisition is a 
type of doing. We ascribe rationality or irrationality not only to 
preference acquisitions, but also to doings in general. There are 
standard patterns in our practice of making such ascriptions to do- 
ings in general, e.g., if a doing is an action and is caused by an 
irrational belief (say, a belief acquired through wishful thinking), 
then we call the doing itself irrational. Since preference acquisitions 
are types of doings, a criterion for the ascription of (ir)rationality to 


There is Elster’s explicit avowal that rational-choice ons are a variety 
of imtentional ons (Rational Choice, p. 12). Furthermore, rational- 
choke theory is focal point of Elster’s discussion of intentional explanation 
(Explaining Technical Change, pp. 74-83). 
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preference acquisitions may be more specific than a criterion for the 
ascription of (ir)rationality to doings in general. The former more 
specific criterion cannot violate the latter more general criterion, 
however. A criterion for the ascription of (ir)rationality to prefer- 
ence acquisitions is constrained by standard patterns in our com 
mon practice of ascribing (r)rationality to doings in general. 

Let ‘a description ? stand for a description of an action under 
which the action is intentional. Consider the following claims con- 
cerning the ascription of (ir)rationality to doings in general. 

(i) Consider the set of all descriptions i. The descriptions in this set all 

denote actions that are intentional under i. Within this set, there is 
both a subset of is which denote actions that are rational under i 
and a subset of is which denote actions that are irrational under i. 

Gi) Consider the set of all descriptions of doings. This set contains: 

(S,) the subset of descriptions denoting actions that are intentional 
under this description; 

(Sq) the subset of descriptions denoting actions that are not inten- 
tional under this description; 

(Ss) and the subset of descriptions denoting mere doings. 

Outside (S,) there simply is no description of a doing that is either 
rational or irrational under that description. 

Elster clearly underwrites claim (i). Certainly an action may be 
rational under some description i. Furthermore, much of Elster’s 
work is devoted to developing a taxonomy for actions that are irra- 
tional under some description i. For example, if the set of mental 
states which provides for a reason for an action under some descrip- 
tion i contains a belief that is irrational because it is an instance of 
wishful thinking, then the action itself is irrational under this de- 
scription.'* 

Let us turn to claim (i). First, consider descriptions of mere do- 
ings, i.e., doings that are not actions, like tripping over the carpet. 
Certainly it is meaningless to say of such doings under any descrip- 
tion that they are either rational or irrational. Secondly, consider 
descriptions of actions that are not themselves descriptions i. In 
1978, Albino Luciani (John-Paul I) died less than 5 weeks after his 
election as pope of the Roman Catholic Church. The sentence “The 
conclave elected Albino Luciani’ and ‘The conclave elected a pope 
who had less than 5 weeks to live’ describe the same action, though 
only the former sentence is a description i. It may well have been 
rational or irrational for the conclave to elect Albino Luciani, 


4 Sour Grapes, pp. 26, 123-4; Rational Choice, pp- 13-4. 
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though it would be meaningless to ask whether it was rational or 
irrational for the conclave to elect a pope who had less than five 
weeks to live. 

On Elster’s criterion, preferences are acquired rationally if and 
only if they are acquired intentionally. This criterion violates the 
constraints on the ascription of (ir)rationality in claim (i) and (ii). On ` 
(i), if preferences are acquired intentionally, then such action may 
be rational or irrational under this description. On claim (ii), if pref- 
erences are acquired nonintentionally, then it is meaningless to say 
that such doing—regardless of whether it is an action or a mere 
doing—is rational or irrational under this description. 

Two other challenges are pragmatical in character. Elster spells 
Out a necessary condition on rational preferences on the basis of 
what he takes to be characteristic differences between SG and CP. 
Two questions can be asked. First, does this necessary condition— 
namely, intentional acquisition—indeed point to a characteristic 
difference between preferences acquired through SG and prefer- 
ences acquired through CP? Secondly, we also hold Judgments con- 
cerning the rationality of preferences outside the context of SG or 
CP. Does intentional acquisition indeed express a necessary condi- 
tion on rational preferences in general? 

Considering the former challenge, I fully agree that preference 
acquisition through CP is intentional in character. But why would 
preference acquisition through SG be any less so? Elster argues that 
the latter phenomenon is caused by a drive and that a drive is not a 
mental state and hence cannot qualify as a pro attitude in the reason 
for an action under some intentional description. Yet why is it that 
drives do not qualify as mental states? Elster mentions the absence 
of first-person knowledge for drives. On my earlier argument, this is 
a moot point, of course, since it may well be the case that a person 
does not have first-person knowledge of the mental states that pro- 
vide for a reason for an action under some intentional description. 
Elster also mentions the imprudent character and the orientation 
toward short-term pleasure of drives.!° Yet mental states may well 
share these features. My passion to cruise the Caribbean is not any 
less of a mental state, furnishing the pro attitude in the reason for 
my intentional purchase of a ticket to Barbados, when I do not have 
the necessary funds as when I do have the necessary funds. Hence, I 


"* Notice that this characterization is inconsistent with Elster’s inclusion of the 
phenomenon of “forbidden fruit is sweet” as a type of preference acquisition 
through drives (Sour Grapes, pp. 111-2). Clearty such counteradaptive prefer- 
ences are not geared toward (short-term) pleasure. 
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cannot see any good reason why preference acquisition through SG 
is any less intentional in character than preference acquisition 
through CP. 

Let us now turn to the latter pragmatical challenge. Must rational 
preferences in general be intentionally acquired? Elster admits that 
there are counterexamples to his theory. There exist preferences 
which are indeed nonintentionally acquired, but which are autono- 
mous and hence fully rational. He mentions preference acquisition 
through socialization or through learning. I take these counterex- 
amples to be more than an indication that the theory is in need of 
some fine tuning. 

I have restricted CP to the project of adjusting one’s preferences 
to the feasible alternatives through the medium of habituation. No- 
tice, however, that there are many other media for intentional pref- 
erence acquisition. One such medium is precisely learning. Here is 
an example. I do not like dry wine, though I want to develop an 
appreciation for it, since all my wine connoisseurs friends serve only 
dry wines at their dinner parties. In order to do so, I may decide to 
enroll in a wine-tasting course, rather than fill up my cellar with dry 
wines. That is, I may choose to mold my preferences through the 
route of learning, rather than through the route of habituation. 
There is nothing irrational about preferences that are intentionally 
acquired through the medium of learning. But neither is there any- 
thing irrational about preferences that are nonintentionally ac- 
quired through learning. I may well come to like dry wines in a 
wine-tasting course, without ever having planned such preference 
change. This suggests that intentional acquisition may have very lit- 
tle to do with the ascription of rationality to preferences. 

The same kind of argument can be made for other media of pref- 
erence acquisition as well. CP is intentional preference acquisition 
through habituation. But, similarly, there is nothing irrational about 
nonintentional preference acquisition through habituation. After 
some years in North America, peanut butter has slightly moved up 
from the bottom of the list in my evaluation of alternative sandwich 
spreads. I have never planned such a thing, yet there is nothing 
irrational about this nonintentional preference change through ha- 
bituation. In conclusion, intentional preference acquisition does not 
seem a promising candidate as a necessary condition on rational 


preferences in general. 
Iv 


In this section, J shall venture a new sohition to our puzzle. I share 
Elster’s intuition that the phenomenon of SG yields irrational prefer- 
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ences while the phenomenon of CP typically’® yields rational prefer- 
ences. The challenge thus consists in constructing a necessary condi- 
tion on rational preferences, such that CP can pass the test, while 
SG does not. 

Preferences are defined over alternatives, but they are not the 
only kind of evaluative judgment defined over alternatives. Consider 
the following instructive quote from Davidson: 


. - . particular choices can be explained in decision theory along lines 
very similar to those followed in reason explanations: a particular ac- 
tion is chosen from among the available set because of the agent's 
beliefs (for example how apt he thinks the action is to produce various 
results) and the relative values he sets on the possible outcomes. His 
desires are thus made comparative and quantitative. Some ordinary 
desires, however, do not translate directly into preferences, so that not 
all reason explanations have a clear decision theory counterpart expla- 
nation. This is particularly evident if we think of conflicting desires. A 
person may have a reason for preferring A to B and another reason for 
preferring B to A. This is an embarassment for reason explanations, 
for they need to predict which reason will win out Decision theory 
skips the problem, for it says nothing about why one basic outcome is 
preferred to another, and the theory bars possible evidence of conflict 
in behaviour (op. cit., p. 269). 


Ordinary expressions of wants, i.e., of evaluative judgments defined 
over alternatives, are ambiguous. If a person claims to want some 
alternative at some point in time, it is unclear whether she is ac- 
knowledging that she has reasons that favor the alternative—rea- 
sons that may or may not be trumped by competing reasons at that 
point in time—or whether she is indicating that the alternative ranks 
highest in her all-out ranking at that point in time. For instance, 
suppose I am on a strict diet and utter the words ‘I really want a 
chocolate malt’. This may be taken to mean that I am strongly at- 
tracted to the sweet flavor of chocolate malt but may nevertheless 
stick to my commitment to dieting. Or it may mean that I have 
decided to slip on my diet and that chocolate malt has moved to the 
top in my all-out ranking over alternative desserts. Instances of the 
latter type of evaluative judgments are named “preferences” in de- 
cision theory and are represented formally by utility functions. Let 
us name instances of the former type of evaluative judgments de- 


18 Í introduce this ification because we want to allow that some cases of CP 
derive their irrationality from familiar patterns of irrationality, e.g., when a char- 
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sires. Hence, desires allow for evaluative conflict, while preferences 
do not. In other words, I may desire alternative A over alternative B 
(for some reason) and I may desire B over A (for some other rea- 
son), but I cannot have a preference for A over B and have a prefer- 
ence for B over A.'” So what is the status of preferences within the 
enclosing set of evaluative judgments? 

In “How is Weakness of the Will Possible?” (op. cit., pp. 37-42; 
also 97-102) Davidson sketches an analysis of practical reasoning. 
In this analysis he constructs a taxonomy of evaluative judgments 
defined over actions. This taxonomy will be helpful in trying to 
understand the place of preferences within the set of evaluative 
judgments defined over alternatives. 

Evaluative judgments defined over action tokens are either rela- 
tional or nonrelational judgments. Relational judgments defined 
over action tokens are relative to reasons. Let a and b be two action 
tokens. For some reasons, I may judge that it is better to do a than b, 
while, for some other reasons, I may judge that it is better to do b 
than a. On a logical analysis, the main connective in a relational 
judgment is the two-place sentential operator ‘. . . is a ground to 
judge that ', in which the first place is filled in by a set of 
reasons and the second place by a ranking of action tokens. 

Nonrelational judgments defined over action tokens are not rela- 
tive to reasons. They are the intentions that accompany our actions. 
If I intentionally do'a rather than b, then my action is accompanied 
by the intention to do a rather than b. Davidson takes this intention 
to be the nonrelational judgment that a is better than b. 

Reasons for relational judgments defined over action tokens 
come in pairs, each containing a relational judgment defined over 
action types and a belief ascribing action tokens to action types. 
Here is an example. On this cold winter night, I am considering 
whether I shall walk my dog or read a book in front of the fireplace. 
On the one hand, I may think it is better to walk my dog than read a 
book, considering my dog needs the exercise. On the other hand, I 
may think it is better to read a book, considering it is so cold out. On 





asa a Cees Dea iy De Kreon for prefening A to Band 
another reason for preferring B to A is slightly confusing. porma 
technical sense do not allow for evaluative conflict of this kind, Davidson appar- 
ently does not carry over this technical sense from ‘preference’ to the related 
verbal form ‘ ing’. I assume that ‘preferring A to B’ is here being used in a 
nontechnical sense is not the kind of evaluative judgment that is at work in a 

ision theory, i.e., it is not synonymous with ‘having a preference for A over B' 
inva decia theoretic % 


ne. 
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a logical analysis, my reason to think it is better to walk my dog than 
read a book is the relational judgment defined over action types: 


(M,) That some action token x is an instance of the action type ‘giving 
proper care to pets’ and some action token y} is an instance of the 
action type ‘neglecting pets’ is a ground to judge that x is better 
than y. 


and the belief: 


(m,) The action token ‘my walking my dog (now)’ is an instance of the 
action type ‘taking proper care of my dog’ and the action token 
‘my reading a book (now)’ is an instance of the action type ‘ne- 
giecting pets’. 


The conclusion of this reason pair is the relational judgment defined 
over action tokens: 


(Cı) (M,) and (m,) are a ground to judge that the action token ‘my 
walking my dog (now)’ is better than the action token ‘my reading 
a book (now)’. 


Analogously, a conflicting relational judgment defined over action 
tokens (C,) can be formed which is relative to the relational judg- 
ment defined over the action types ‘exposing myself to the cold 
weather’ and ‘not exposing myself to the cold weather’ (M,) and the 
matching belief ascribing the action tokens in question to these ac- 
tion types (mg): 


(Cy) (M,) and (m,) are a ground to judge that the action token ‘my 
reading a book (now)' is better than the action token ‘my walking 


my dog (now)’. 


There is a particular relational judgment defined over action to- 
kens in which we take a special interest, namely, the relational judg- 
ment that is relative to all of the agent’s reasons, i.e., all of the 
mental states that the agent takes to be relevant to the choice at 
hand. Such relational judgment is an all-things-considered judg- 
ment. Singular reason pairs may lead to conflicting relational judg- 
ments defined over action tokens. The all-things-considered judg- 
ment is then determined by the relative importance that the agent 
assigns to each reason pair. All-things-considered judgments do not 
follow from the agent’s reasons as a matter of logic. I myself am a 
pet lover and the cold weather does not throw me off that easily. So 
here is my brave all-things-considered judgment: 
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(Ca) Mi, m, Me, and m, are a ground to judge that ‘my walking my 
dog (now)’ is better than ‘my reading a book (now)’. 


Nonetheless, I may stay home and read a book in front of the 
fireplace. How so? Either I may not have formed an all-things-consi- 
dered judgment at all, because I did not give sufficient thought to 
the choice at hand. Or I may have formed such judgment but, 
through weakness of the will, I did not act in accordance with it. In 
both cases, the intention or nonrelational judgment that accompa- 
nies my action is not informed by the all-things-considered judg- 
ment, but rather by the relational judgment (C,), which is relative to 
a subset (containing M, and m,) of the complete set of reasons. 

On Davidson's principle of continence, an action is rational only 
if the intention that accompanies it is informed by the agent’s all- 
things-considered judgment for the choice at hand. Both ill-con- 
sidered as well as akratic actions fail to pass this criterion of rational 
action. 

What can be learned from Davidson’s taxonomy for understand- 
ing the place of preferences in the set of evaluative judgments de- 
fined over alternatives? Let alternatives be state-of-affairs tokens 
(SOA tokens). For instance, my preference for this particular Volvo 
over that particular Austin ts my preference for the SOA token of 
‘my owning this Volvo (now)’ over the SOA token of ‘my owning 
that Austin (now)’. 

Preferences are a special kind of judgment defined over SOA 
tokens in that they do not allow for evaluative conflict. For some 
reasons I may desire the Volvo rather than the Austin and for some 
other reasons I may desire the Austin rather than the Volvo, but I 
cannot have a preference for the Volvo over the Austin and have a 
preference for the Austin over the Volvo. This should ring a bell. I 
propose that preferences, just like intentions, are nonrelational 
judgments. Other evaluative judgments defined over SOA tokens 
are relational judgments, i.e., they are relative to reasons. And this 
parallel can be carried even further. Reasons for relational judg- 
ments defined over SOA tokens come in pairs, each containing a 
relational judgment defined over SOA types and a matching belief 
ascribing SOA tokens to SOA types. For example, a reason for de- 
siring this Volvo rather than that Austin is the relational judgment 
defined over SOA types: 


(Mf) That a SOA token x is an instance of the SOA type ‘owning a 
Swedish car’ and a SOA token y is an instance of the SOA type 
‘owning a British car’ is a ground to judge that x is better than y. 
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and the belief: 


(mf) The SOA token ‘my owning this Volvo (now)’ is an instance of the 
SOA type ‘owning a Swedish car’ and the SOA token ‘my owning 
this Austin (now)’ is an instance of the SOA type ‘owning a British 
Car’. 


This reason pair implies the relational Judgment defined over SOA 
tokens: 


(Cf) (M,) and (m,) are a ground to judge that the SOA token ‘my 
owning this Volvo (now)’ is better than the SOA token ‘my owning 
this Austin (now)’. 


Again, different reason pairs may support conflicting relational 
Judgments defined over SOA tokens. A particular relational judg- 
ment in which we take a special interest is the all-things-considered 
judgment defined over SOA ‘tokens. This judgment is relative to all 
of the person’s reasons, i.e., all of the mental states that the person 
takes to be relevant to the ranking at hand. 

Davidson’s principle of continence imposes a constraint on ac- 
tions. This principle can also be understood to impose a constraint 
on the intentions accompanying the actions. The intention to do a 
rather than b, i.e., the nonrelational Judgment defined over action 
tokens that it is better to do a than b, is rational only if it is informed 
by the all-things-considered judgment for the choice at hand. I pro- 
pose to construct an analogous constraint for preferences: 


(R) A preference for SOA token s over SOA token t, i.e., the nonrela- 
tional judgment defined over SOA tokens that s is better than t, is 
rational only if it is informed by the all-things-considered judg- 
ment for the ranking at hand. 


All-things-considered judgments are relational judgments that are 
relative to all of the person’s reasons, i.e., all of the pai 
containing a relational judgment defined over SOA types and a be- 
lief ascribing SOA tokens to SOA types—which the person takes to 
be relevant to the ranking at hand. Let us name a relational judg- 
ment defined over SOA tokens which is relative to one such reason 
pair a criterial judgment. An example of such criterial judgment is 
(Cf), i.e., on a more colloquial version, my judgment that, consider- 
ing its country of origin, I would rather have the Volvo than the 
Austin. But there may also be other criteria, which the person takes 
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to be relevant to her ranking, say, color, year, etc. The role of the 
all-things-considered judgment is to arbitrate between criterial 
judgments. If all criterial judgments favor the same alternative, ar- 
bitration simply means acknowledging the unanimous vote. If dif- 
ferent criterial judgments favor different alternatives, arbitration 
involves an assessment of the relative importance that each consider- 
ation carries in one’s mental life. I can thus rephrase the constraint 
on preferences: 


R’) A preference for SOA token s over SOA token t is rational only if 
it results from proper arbitration over all criterial judgments de- 
fined over s and t, implied by the reason pairs that the person 
takes to be relevant to the ranking at hand.'® 


How do we go about judging the propriety of such arbitration? 
For our purposes, it is sufficient to see that there are clear cases of 
preferences that are irrational precisely because they did not result 
from proper arbitration over all criterial judgments. For instance, 
suppose I get carried away by the wonderful stereo equipment in the 
Austin. I thus set my preference for the Austin over the Volvo, 
overlooking that the Austin is an older, British car in an ugly ochre. 
On due reflection, I must admit that the consideration that set my 
preference carries much less importance for me than the consider- 
ations that were overridden by it. It can be agreed that such would 
indeed be an irrational preference. 

I shall now argue that this necessary condition on rational prefer- 
ences can solve our puzzle, i.e., can drive a wedge between SG and 
CP. Let alternatives s and ¢ both be feasible alternatives to begin 
with. J have a preference for s over t and this preference is informed 
by an all-things-considered judgment, i.e., it results from proper 
arbitration over all criterial judgments. It now turns out that, for 
some reason, alternative s is no longer feasible anymore. In re- 
sponse, I intentionally adjust my preference such that I come to 
have a preference for t over s. So far, SG and CP run entirely 
parallel. They also run parallel in that I may or may not have first- 
person knowledge of my reasons for adjusting this preference, i.e., I 


18 One may object that no more is required for rational preferences than that 


prominence 

tives in question has in one’s life. This objection is welltaken and can be accom- 
modated for in criteria (R) and (R’) and in the t discussion at a serious 
cost of stylistic simplicity, though without affecting general argument. 
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may or may not know that I resort to this preference adjustment 
because I realize that my favorite alternative is no longer feasible 
anymore. 

The phenomena do differ, however, in the following respect. In a 
typical case of SG, I adjust my preference, though I do not adjust 
any of the relational judgments defined over the SOA types, nor the 
relative importance I attach to each consideration. Assuming fixed 
beliefs, the original set of reason pairs thus remains constant and 
there is no change in the relative importance that each pair carries 
in my mental life. Since the all-things-considered judgment for the 
ranking at hand is relative to this complete set of reasons, it too 
remains fixed over the course of the preference adjustment. In 
other words, nothing changes about the arbitration problem over 
the criterial judgments for the ranking at hand. Hence, the adjusted 
preference cannot be informed by the all-things-considered judg- 
ment, i.e., it cannot result from proper arbitration over all criterial 
judgments. And this is what accounts for the irrational character of 
preferences acquired through SG. 

A typical case of CP is the more involved project in which I adjust 
my preference as well as my reasons for the ranking at hand. That is, 
either I shift around some of the relevant relational judgments de- 
fined over SOA types (which, together with fixed beliefs, imply a 
change in criterial judgments), or I change the relative importance 
that I attach to each consideration. Consequently, relative to my 
adjusted set of reason pairs and the relative importance I assign to 
them, I now think that the still feasible alternative t is better than the 
no longer feasible alternative s. In my adjusted all-things-considered 
judgment, alternative t moves up above alternative s. Hence, over 
the course of the CP move, my preference in flux remains informed 
by the all-things-considered judgment in flux. In other words, my 
preference results at all times from proper arbitration over the rele- 
vant criterial judgments. And this is why there is nothing irrational 
about preferences acquired through CP. 

Let us turn to an example. For quite a while I have been inter- 
ested in a particular new job, which would give me a chance to work 
overtime at a decent pay and with challenging career opportunities. 
I then find out that this change of jobs is no longer feasible for some 
reason. In response, I shift around my preference such that I come 
to prefer my actual job to the new job. 

In a typical case of SG, this preference adjustment is the only 
evaluative shift in my mental life. I still hold the relational judgment 
over SOA types that ‘having a job with opportunities for well-paid 
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overtime work’ is better than ‘having a job which leaves time for 
leisure’ and that ‘having a job with challenging career opportunities’ 
is better than ‘having a more stable and less stressful job’. The set of 
relational judgments defined over SOA types and of beliefs ascrib- 
ing SOA tokens to SOA types (‘my having the new job’ is an instance 
of ‘having a job with opportunities for well-paid overtime work’, 
etc.) still imply a pair of criterial judgments, which cast a unanimous 
vote in favor of the new job. I still am the kind of person who has all 
reason to prefer the new job to my actual job. If I do nonetheless 
have a preference for my actual job over the new job, then my 
preference is not informed by the all-things-considered judgment 
for the ranking at band. In other words, my preference does not 
result from proper arbitration over the relevant criterial judgments, 

which, in this particular case, would be acknowledging their unani- 

mous vote. Consequently, my adjusted preference in favor of my 
actual job can save me from frustration only at the cost of being 
irrational. 

In a typical case of CP, I effect a more radical change in my 
mental life. Not only do I come to hold a preference for my actual 
job, but I also bring about a shift in the set of relational judgments 
defined over SOA types such that, indeed, I become the kind of 
person who genuinely has reason to hold the adjusted preference. I 
now appreciate the charm of a job which allows me more time for 
leisure and in which there is no pressure for promotion. My ad- 
justed preference is thus informed by the all-things-considered 
judgment that is relative to my newly acquired relational judgments 
defined over SOA types together with fixed betiefs. That is, it results 
from proper arbitration over the newly acquired criterial judg- 
ments, which now cast a unanimous vote for my actual job. Hence, 
my adjusted preference in favor of my actual job passes through as a 
fully rational preference. 

What about La Fontaine’s fox? The fox loses his appetite for 
grapes when he finds out that he cannot reach for them. We do not 
assume, however, that he has changed his relational judgment de- 
fined over SOA types that, say, ‘indulging in juicy summer fruits’ is 
better than ‘keeping to a more wholesome diet’. Over the course of 
the SG move, he remained the kind of fox who has a craving for 
juicy summer fruits in general. And what about the poker player? 
The CP poker player did not only change her preference concerning 
fair-play poker in the well-lit Casino, but she also came to hold the 
relational judgment defined over SOA types that ‘playing a fair 
game’ is better, is more fun than ‘cheating’. Over the course of the 
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CP move, she has become the kind of person who genuinely values 
fair play in general.’ 

Both the unsuccessful job hunter and the poker player changed 
around all relevant relational judgments defined over SOA types in 
their CP moves. Notice that CP may well take a more subtle course. 
It may be sufficient to change a proper subset of the relevant rela- 
tional judgment defined over SOA types to effect the desired shift in 
the ranking of the all-things-considered judgment. Or, in case of 
conflicting criterial judgments, it may be sufficient to adjust the 
relative importance attached to the respective considerations to 


have proper arbitration result in the desired preference shift. 
v 


In section I, I considered three responses to our puzzle. I argued 
that, though none of these responses was correct, they were all in 
some way very telling suggestions. A proper solution to our puzzle 
should provide some insight as to why it is that each suggestion does 
indeed seem prima facie plausible. Let us consider whether my 
theory of rational preferences can meet this challenge. 

(a) Between clear-cut cases of SG and CP, there is a broad “gray” 
area of preference adjustments that are undecided as to whether SG 
or CP is the more appropriate classification. Can my theory account 
for this phenomenon? In a typical case of SG, my complete set of 
reasons for the ranking at hand remains untouched over the course 
of the preference adjustment. In a typical case of CP, I adjust both 
this set of reasons and my preference such that the adjusted prefer- 
ence is informed by the all-things-considered judgment that is rela- 
tive to the adjusted set of reasons. In between these typical cases, 
there is a continuum of preference adjustments, in which I actually 
do shift around some relational judgments defined over SOA types 
or make some changes in the relative importance I attach to each 
consideration, though none of this is quite enough to let the balance 
in the all-things-considered judgment tip over in favor of the feasi- 
ble alternative. Some steps are being taken in adjusting the set of 
reasons for the ranking at hand, though not quite enough for the 
adjusted preference to pass through as a rational preference. Be- 
tween typical cases of SG and typical cases of CP there is a contin- 
uum of preference adjustments accompanied by not entirely success- 
ful shifts in the set of reasons for the ranking at hand. 


character, and c) shifts from irrational to rational preferences which are moti- 
vated by feasibility considerations. I intend to address these objections in a later 
paper. 
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Are preference adjustments on this continuum watered-down 
cases of SG or of CP? This is merely a verbal point. On the one 
hand, it seems appropriate to talk about SG for all cases on the 
continuum, because of the irrational character of the thus acquired 
preferences. On the other hand, if a person has made serious 
changes in her relational judgments defined over SOA types, or in 
the relative importance she attaches to the criteria involved, it seems 
appropriate to talk about CP, even though these changes may not be 
quite sufficient to support her preference adjustment. Many real- 
life or fictional cases of preference adjustment are actually located 
on this continuum. They are neither typical cases of SG, nor typical 
cases of CP. My theory shows how there is something to be said for 
classifying such cases in either category. And this accounts for the 
phenomenon that there is indeed a broad range of cases in which we 
are undecided as to whether ‘SG’ or ‘CP’ is the more appropri- 
ate term. 

(b) The SG move typically is a response at the spur of the mo- 
ment, while CP typically is more of a long-term process. If prefer- 
ences are abandoned at the spur of the moment, there is a presump- 
tion of irrationality, which there is not if such occurs gradually. My 
solution to our puzzle can readily account for this presumption. 
While SG targets only one’s preference, CP focuses on the complete 
chunk of one’s mental life which is relevant to the preference in 
question. SG involves one single adjustment, CP involves a project 
of change. A sudden change of preference in response to consider- 
ations of feasibility thus makes us suspicious that SG rather than CP 
is in play. 

(c) In trying to give an account of SG, there is always the tempta- 
tion to call in self-deception, even so if the fox would deny his 
appetite for the grapes upon sincere introspection and would turn 
his head if we were to lower the vine for him. Can my solution 
account for this temptation? 

In attributing a preference to a person we rely on presumptions. 
One such presumption is that, if a person acknowledges a prefer- 
ence upon sincere introspection, then she does actually hold such 
preference. Another presumption is that a person’s choices are in- 
dicative of her actual preferences. Certainly, both presumptions 
may clash and, within particular circumstances, one presumption 
may seem to deserve more credibility than the other. For instance, if 
the controversial choice is made under stress, then we tend to put 
more trust in the introspective presumption. On the other hand, if 
acknowledging some preference is threatening to one’s self-image, 
then we tend to put more trust in the behavioral presumption. 
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There is also a third presumption that comes into play, namely, a 
principle of charity in practical reasoning™: if a person holds a set of 
reasons which makes it rational for her to hold one preference 
rather than some other, then there is a presumption that she actu- 
ally does hold this preference. In claiming that the fox actually did 
change his preference, this presumption of rationality is not re- 
spected. On the other hand, if we were to claim that the fox did not 
change his preference, then we would not respect both the intro- 
spective and the behavioral presumption. 

I do not want to suggest that the number of presumptions can tip 
the scale. Rather, I propose to restrict the term ‘SG’ to those cir- 
cumstances in which we tend to put our trust into the introspective 
and the behavioral presumption, rather than into the presumption 
of rationality. But even though one may feel most at ease in trusting 
the introspective and the behavioral presumption, the temptation 
sull lingers to interpret the phenomenon in reference to a principle 
of charity in practical reasoning, i.e., to deny that any preference 
adjustment has taken place. If this temptation were to be followed, 
one would need to take recourse to self-deception to make up for 
not respecting the introspective presumption. Hence, my theory of 
rational preferences drives a wedge between SG and CP without 
describing the former phenomenon in terms of self-deception. At 
the same time, it can explain why there is a prima facie plausibility to 
such description. 

LUC BOVENS 
University of Colorado/Boulder 


™ Davidson, op. cit., pp. 221-2; and “Paradoxes of Irrationality,” in R. Woll- 
beim and J. Hopkins, Philosophical Essays on Freud (New York: Cam- 
bridge, 1982), pp. 289-305. 
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LOOPHOLES IN MORALITIES* 


ORALITIES, in one sense of this word, are abstract 

standpoints outside the world from which what occurs 

within it—actions, persons, institutions, states of affairs 
—can be evaluated. In another sense, moralities are themselves in 
the world: more or less unified sets of moral beliefs and attitudes 
held by particular persons or groups. In this second sense, morali- 
ties have real effects; and these may themselves be made a subject of 
evaluation. Such a perspective upon conduct-guiding structures of 
values and norms—or codes—is familiar to economists and legisla- 
tors, who take social effects into account in designing or evaluating a 
tax code, for example. My aim is to extend this perspective to both 
key domains of morality: to ethics (concerned with the moral evalua- 
tion of conduct and character, with the life one should lead and the 
person one should strive to be) and to justice (concerned with the 
moral evaluation of social institutions that regulate and pattern hu- 
man interactions). My general question is then: What role, if any, 
should an evaluation of the effects of moral codes play in our sub- 
stantive moral reflection?’ 

Clearly, we should take account of such effects only if their con- 
nection to the relevant code is reasonably firm. It is not rational to 
worry about any fortuitous effects a morality (in the second sense) 
may have had in the past, if they tell us nothing about what effects it 
is likely to have in the future. Taking account of the differential 
effects of alternative moralities makes sense only insofar as we can 
estimate these effects in advance. My question then concerns effects 


unlike the law, they are not codified, nor officially adjudicated and enforced. We 
live, one might say, in a world without moral codes. But then my talk of moral 
codes is a response to a theoretical need rather than to an empirical reality. We 


can fix upon some rather determinate specimen to which this analysis can be, 
somewhat rigorously, applied. Talk of moral codes then reflects an idealization. I 


ourselves, if our moral beliefs—though amorphous and diverse—are sufficiently 
similar in content and effects to the h moral code of the example. 
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of moral conceptions which are predictable at least in a rough proba- 
bilistic way. 

This may be reminiscent of Nietzsche’s question: What is the value 
of our morality? Yet I am not judging a morality by some novel, 
alien measure of value, but by its own standards: Have we organized 
our moral commitments in a way that reflects, and helps effectively 
achieve, what by their own lights matters? My goal is to uncover new 
kinds of considerations of relevance to our moral reflection and 
discourse, which may help break the stalemate of conflicting appeals 
to moral intuitions and thereby make moral dialogue more fruit- 
ful, moral convergence more likely, and our moral values more 
effective. 

To keep my topic manageable, I confine myself here to one class 
of predictable effects, namely, to cases where a code is counterpro- 
ductive by giving incentives toward conduct that is regrettable by 
the code’s own lights.” In saying that a moral code can be counter- 
productive, I am assuming that it is understood as having some 
point, rationale, purposes, or objectives. This is the typical case in 
the modern world. But it is also possible, of course, to think of a 
moral code as the moral truth or God’s law. We can then think of it 
as not being in the business of producing anything, and hence as 
incapable of being counterproductive. On this understanding, we 
would not have second thoughts about our morality even if our 
holding it predictably led to the extinction of the human species or 
the disappearance of this morality itself. 

That a code gives someone an incentive to act in a certain way 
means only that it gives her a reason to act in this way. Such a reason 
for conduct need not affect conduct itself: a code can encourage 
conduct without triggering or influencing conduct.’ I count both 
the incentives a code gives and any conduct these may elicit as ef- 
fects of this code. My general question has thus become more spe- 
cific: Can the fact that a morality gives incentives toward regrettable 
conduct constitute at least some reason to revise this morality? 

I believe so. But let me add three cautionary notes to this claim. 
First, the fact that a code gives an incentive toward conduct that in 

? I shall skip the ‘by the code’s own lights’ wherever possible. But it must 
be kept in mnd we are throughout ing with counterproductivity, i.e., 
with cases where moralities undermine their own effectiveness or, as it were, 
regret their own effects. 

* For example, a tax deduction for charitable gifts provides an incentive to 
many taxpayers. But for some of these, the incentive may not tip the 
are more inclined than they would otherwise be, but still not inclined enough, to 
give. And others may not know about the tax deduction or may not realize how 
they can take advan of it. Codes may then give incentives that do not affect 
conduct at all. As I show, some such incentives are problematic nevertheless. 
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some instances is regrettable in some respect does not automatically 
render this code counterproductive tout court. The conduct in ques- 
tion may be (by the lights of the same code) valuable in other in- 
stances or in other respects and thus not be regrettable on the 
whole. Alternatively, the overall point, rationale, purposes, or ob- 
jectives of the code may demand that it give the relevant incentive, 
or that it have certain features of which this incentive is an unavoid- 
able byproduct, despite the fact that the incentive is also regrettable, 
even on the whole. The problems I want to examine are raised only 
when a code is counterproductive all things considered, i.e., when 
its overall point, rationale, purposes, or objectives would be better 
fulfilled if this code were replaced by a somewhat different code. I 
therefore assume in my more abstract discussions, or seek to show in 
my more concrete examples, that there really are no redeeming 
virtues that would allow one to say that the moral code under dis- 
cussion—though it gives regrettable incentives—is not counterpro- 
ductive all things considered. 

Second, the effects of a code that is prevalent in some society 
depend to some extent on the life context of this society (including 
its culture, technology, and natural environment). Likewise, the ef- 
fects of a code held by an individual are shaped by the life context of 
that particular person.* It may then sometimes be possible to reduce 
or eliminate the regrettable effects of a moral code through deliber- 
ate modifications of the life context in which it is held. There are 
other cases, however, where such strategies are either unavailable or 
themselves very costly in moral terms. 

Thus, my claim specifically targets cases where a morality is coun- 
terproductive all things considered in virtue of giving regrettable 
incentives that are not reasonably avoidable. In such cases we have 
reason to revise our morality. But—and this is the third cautionary 
note—such a reason may not be decisive. I do not claim, by any 
means, that the effects of moralities are all that counts in moral 
reflection. Only that they are important, and decisive in some cases. 

I. A CRUCIAL DISTINCTION 
It may seem that an incentive given by a code can be problematic 
only insofar as it actually elicits regrettable conduct. This view may 
indeed be plausible as regards concrete incentives, i.e., reasons for 
conduct which a code gives to fully specific agents who actually do 

4 These two paradigm cases do not exhaust the realm of possibilities, A moral 
code may be socially prevalent in a small social system, may held by a minority 
within a society (with the majority perha committed to some other code), may 
be held by an individual who is highly i tial in a society (the President of the 
U.S.A), and so forth. In all these cases, the effects of the code are shaped by the 
life context in which it is operative. 
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or will exist (or to statistical aggregates of such agents, such as the 
French taxpayer). But one can speak of incentives in another sense 
as well. Ideal incentives are encouragements or discouragements 
that a code, as it were, “wants” or “intends” to give (which does not 
mean that they must have been thought of, let alone intended, by 
some creator(s) of it). Leas metaphorically, ideal incentives are rea- 
sons for conduct which a code would provide to persons in the given 
life context, if they were ideal adherents, i.e., agents who are fully 
rational, who fully understand the code and interpret it correctly, 
who are fully committed to comply with any requirements of the 
code, and whose circumstances and remaining interests fall within 
the normal range. Insofar as a code provides ideal incentives, we can 
say that it guides persons toward certain conduct (or misguides 
them, if this conduct is regrettable on the whole). 

Ideal incentives are of two kinds. With compliance incentives, 
ideal adherents are motivated solely by their commitment to do what 
the code requires or suggests. Incentives deriving from the code’s 
requirements are always decisive for an ideal adherent (assuming the 
code is consistent), while incentives deriving from its suggestions 
need not be. With reward incentives, ideal adherents are motivated 
by other, code-independent interests of theirs insofar as these can 
be pursued without violating requirements of the code. Here a code 
encourages conduct by affecting the official pay-offs: eligibility for 
benefits or liability to burdens. Reward incentives need not be 


Two examples: a tax code requiring that one pay taxes on income 
from nonmunicipal bonds provides a compliance incentive to pay 
taxes on such income, and a reward incentive to invest in municipal 
rather than nonmunicipal bonds; a morality that requires single 
young adults to volunteer for a stint in the military provides a 
compliance incentive to enlist, and a reward incentive to get mar- 
ried early.’ 

Such ideal incentives may manifest themselves in concrete incen- 
tives (and thereby possibly in conduct as well) if ideal adherents of 
the code or persons sufficiently similar to them actually exist. But 
ideal incentives may also fail so to manifest themselves: neither the 
compliance incentives nor the reward incentives in our two exam- 
ples would manifest themselves as concrete incentives to persons 


*I am here assuming that ideal adherents, having normal interests, care more 
about enhancing their own income than the government’s and would also rather 
avoid enlistment. In the second example, serving in the military is morally re- 
quired and legally voluntary. 
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who do not know the relevant parts of the codes or do not care 
whether they are in compliance or not.® 

As a code may provide ideal incentives that do not manifest them- 
selves as concrete incentives, it may also give concrete incentives 
that do not correspond to any ideal incentives it provides. Thus, a 
tax law may incline certain persons to keep their fortune abroad 
(where it can more easily be hidden from the tax authorities). And 
our ethic may incline certain persons to pretend to be needy (in 
order fraudulently to exploit the ethical command to help the 
needy). In these cases, a code gives persons reasons to transgress a 
requirement of this code, which would not be reasons for them if 
they were ideal adherents of it. 

With this distinction between the ideal and the concrete sense, 
our more specific question splits into two. One is the ideal question: 
May we have reason to revise our morality if, in the given life con- 
text, it provides ideal incentives that are regrettable by its own 
lights? The other is the concrete question: May we have reason to 
revise our morality if, in the given life context, it provides concrete 
incentives that are regrettable by its own lights? This essay focuses 
exclusively on the former question.’ 

II LOOPHOLES 

Even if they affect no one’s conduct, regrettable ideal incentives can 
be problematic by revealing a structural flaw in the code providing 
them. Suppose some tax code provides a compliance or reward in- 
centive that would induce some taxpayers, if they were ideal adher- 
ents, to act in ways that directly undermine a central purpose of this 
code. The code may then be criticized for containing a loophole and 
thus being in a sense incoherent. And this critique can be sustained 
even if corresponding concrete incentives do not exist or are not 
noticed, so that the loophole does not actually elicit any regrettable 
conduct at all. 

A loophole exists only if the connection between the code and the 
regrettability of the conduct it encourages is tight in two respects. 
First, the relevant incentive must be an ideal one, so that the code 
can be said to guide agents toward the regrettable conduct. Second, 


6 By providing ideal incentives, a code guides (or misguides) ts. Like a 
traffic sign, it can do so without affecting their conduct: it is that agents 
differ from ideal adherents, so that the code’s ideal incentives do not manifest 
themselves as concrete incentives to them. And it is possible that agents, however 
similar to ideal adherents, disregard reward incentives or compliance incentives 
deriving from the code’s ons. 

7 Some prelimi thoughts on the latter question are presented in my ‘“The 
Effects of t Moral ceptions,” in Social Research, LVU, 8 (Fall 1990): 
649-63. 
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the conduct it encourages must be regrettable in itself rather than 
in virtue of any further effects it may bring about, however predict- 
ably. So, through loopholes agents are guided toward conduct that 
is in itself regrettable on the whole. 

Conduct is regrettable in itself just in case it goes against what the 
code requires, or essentially aims at outcomes that are (by the code’s 
own lights) regrettable on the whole. By definition of “ideal adher- 
ent,” a consistent code will not provide ideal incentives toward con- 
duct that is in the first way regrettable in itself—it will not, that is, 
guide agents toward conduct that it also forbids. The topic of loop- 
holes thus boils down to cases where a code guides agents toward 
conduct that essentially aims at outcomes that are regrettable on the 
whole. I call such ideal incentives inherently regrettable. 

OI. SOCIAL ARRANGEMENTS 

To illustrate the problem, let me here concentrate on moral loop- 
holes involving social arrangements, by which I mean relationships, 
associations, or social systems, such as marriages, lawyer-client rela- 
tionships, families, firms, churches, and states. Social arrangements 
tie their members together through normative expectations that are 
based upon special ties, including moral ties, that define special 
rights and obligations, powers and responsibilities. Such special 
moral ties vary in formality—ranging from the justiciable and en- 
forced explicit provisions of a legal code to the vague understand- 
ings that give rise to moral expectations (and disappointments) 
among friends or compatriots. 

Consider the ethical view that as a member of a social arrange- 
ment one may sometimes (when acting in behalf of other members 
or of the entire group) deliberately harm outsiders in some specific 
way, even while one may not do so when one is acting on one’s own. 
Such views are, I believe, actually held. In the business world, those 
who implement a corporate policy that is harmful to consumers, 
employees, or the general public often stress their status as man- 
agers and their obligations toward the firm’s owners, whose finan- 
cial interests they were hired to promote. How is this supposed to be 
ethically relevant? Such a manager is not trying to pass the buck by 
suggesting that the owners, too, must bear some of the responsibility 
for the offensive policy and that he therefore bears less. What he 
implies, rather, is this: thanks to the differentiation of roles between 
owners and manager, the policy in question can be implemented 
without wrongdoing—though this would not be the case if he were 
managing his own firm so as to promote his own financial interests.’ 


ma aay in the academic world, university presidents and trustees have some- 
times defended their decision to invest part of their institution’s endowment in 
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Ethical views of this sort guide their adherents to form or join 
social arrangements in order to effect, through the special ties these 
involve, a unilateral reduction of responsibility toward those left out 
of these arrangements. Such guidance is inherently regrettable— 
not, of course, because additional social arrangements may be 
formed, but because harms that are straightforwardly morally pre- 
cluded can be indirectly brought about after all. Let us consider two 


examples in some detail. 
IV. CASE 1 


Your father owns a large apartment complex. Renting the apart- 
ments as they are is profitable, but market conditions are such that it 
will soon become preferable to convert them into luxury accommo- 
dations while doubling the rents. Your father knows, however, that 
most of the apartments are occupied by elderly tenants who bave 
lived there for decades. Few, if any, of them could afford to stay. 
The others would have a hard time finding new homes. More impor- 
tantly, they would be scattered all over town and thus would lose the 
companionship they have found with one another—the greatest 
source of pleasure in their final years. Your father understands that 
it would be wrong of him to force the tenants out. But he also 
resents the idea of giving up the extra income. It occurs to him that 
many landlords like himself employ lawyers to manage their real 
estate in the owners’ best interest. So your father hires a lawyer as 
well and instructs her to manage his real estate in a professional way. 
As was to be expected, the lawyer eventually initiates the conversion 
of the complex. Rents double and the old tenants are scattered all 
over town. 

The assumptions I am making about the prevalent ethical code, E, 
are indicated in the top row of the following table: 


Option Al Option A2 Option B 
(aggregate) moral status: permitted forbidden permitted 
aggregate nonmoral cost: high Zero negligible 
E forbids your father to initiate the conversion on his own (A2). As 
for the remaining two options, E does not require (though it may 
suggest) abandonment of the conversion (A1) in preference to the 
lawyer’s initiation of the conversion in the owner’s behalf (B). A 





ir fiduciary obligations toward the 
ee ee (e.g., in South African 
goldmines or in certain arms-export ) can be ethically acceptable when it is 
made on behalf of the university even while it would not be ethical if it were made 
with personal funds. 
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landlord is not required to be personally involved in the manage- 
ment of his property. He may surely entrust his real estate to a 
lawyer. And such a lawyer is not required to refrain from the con- 
version. She has a professional responsibility to her client, and there- 
fore may (or even should) administer property entrusted to her in 
the client’s best interest. It is simply not her role to ponder whether 
her client is not rich enough as it is, and should not therefore forego 
some benefit in order to preserve the tenants’ community and form 
of life.® 

An ethic incorporating these beliefs provides an ideal incentive to 
hire the lawyer.’° This incentive is inherently regrettable, provided 
the responsibility of the “solitary” landlord is conceived as, to some 
extent at least, a responsibility toward Ais tenants, and the wrong he 
would commit by converting the building (A2) as a wrong done to 
them. J assume here that one main rationale for E's forbidding such 
single-handed conversions is to protect the tenants’ community 
against deliberate destruction for the sake of greater landlord 
profits. E itself suggests this rationale if, for example, its evaluation 
of the single-handed conversion varies with the extent to which it 
would foreseeably harm the tenants. Insofar as this rationale is pres- 
ent, the incentive in case 1 indicates a loophole: E guides the land- 
lord toward the profitable conversion by enabling him to render it 
free of wrongdoing. Yet, according to E's rationale, as implicitly 
suggested in Æ, this conversion is regrettable on the whole: the ten- 
ants’ suffering is not outweighed by the landlord’s extra income. E's 
own rationale thus gives a reason for closing the loophole—for 
blocking the opportunity unilaterally to reduce aggregate responsi- 
bility. 

This does not mean, of course, that an acceptable ethic must 
require a lawyer to give no more weight to her client’s interests than 
to those of other persons. It is not objectionable that persons may 
form or join social arrangements whose members agree to owe one 
another much more than persons owe one another in general as a 
matter of common decency. So I have no quarrel with the idea that 


° Tt 1s important that the lawyer, too, is in compliance with what E requires (that 
the aggregate moral status of option (B) is permitted”). I am not interested here 
in the opportunity, which some ethical codes may afford, of reducing one’s own 
wrongdoing (one’s share of responsibility for some transaction) by partly shifting 
it off to another. aoe 

19 More precisely, such an ethic—as sketched for example in Charles Fried, 
“The Lawyer as Friend: The Moral Foundations of the Lawyer-Client Relation,” 
in Yale Law Journal, LXXXV (1976)—gives landlords an ideal compliance 
incentive to prefer (B) over (A?) and an ideal reward incentive to prefer (B) 
over (Al). 
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persons, by becoming members of a social arrangement, may in- 
crease what they owe the other members and may thus come to owe 
them more than they owe all outsiders. What I find problematic is 
the idea that persons, by so increasing what they owe certain others, 
may reduce what they minimally owe everyone else. I say minimally, 
because an ethic may of course allow persons to reduce what they 
owe others; it may, for instance, allow one to get a divorce through 
which one reduces what one owes to the person one had married. So 
my point concerns only those basics which persons owe all others as 
a matter of common decency. However an ethic may fix these mini- 
mal general demands, it should not enable persons unilaterally to 
reduce or dilute them. Specifically, it should not allow them, by 
forming or joming a social arrangement, to subject themselves to 
new, countervailing obligations to its other members which may out- 
weigh, trump, or cancel their minimal obligations to everyone else. 
Forming or joining a social arrangement should not have that sort 
of moral efficacy, should not introduce that sort of moral change. It 
should not be among the legitimate benefits of forming or joining a 
social arrangement that one thereby weakens the minimal moral 
protections that those who are now outsiders enjoy against oneself. 

Even this narrow demand is defeasible because an incentive— 
even though it depends on the opportunity unilaterally to sidestep 
aggregate responsibility for given barms—can be desirable in an- 
other respect and so not regrettable on the whole. About case 1, for 
example, someone might object that I have underdescribed E's 
rationale: there is a point after all to the incentive E provides, 
namely, the eminent moral importance of the hiring of lawyers. It 
matters more, morally, that landlords should hire lawyers than that 
they and their lawyers should not knowingly destroy their tenants’ 
community for extra profit. This objection marks a weak spot in my 
entire approach, because, quite generally, the claim to have found a 
loophole can always be countered by claiming that the relevant ideal 
incentive is not in fact regrettable on the whole. 

Stil, I am not too worried about this weak spot, because what 
counts is not making a counterclaim but making it plausible. To see 
whether it is plausible, we must examine whether the code overall 
really reflects the suggested rationale for the questionable incentive. 
If it were part of Fs rationale that additional hiring of lawyers is 
morally important, then this value ought to surface throughout the 
code. If there are no other indications of such a value in the code, 
then the conjecture proposed by the objector is implausible. 

Insofar as E is merely a creature of my own imagination, there is 
no point in conducting such an examination for case 1. But insofar 
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as E is meant to be a realistic code that may actually be held in our 
society, we can dismiss the counterclaim. For no one (except a few 
lawyers?) seriously believes it to be morally important that there be 
more hiring of lawyers. 

I conclude that, if E prohibits self-initiated conversions at least 
partly on account of the damage they would do to the tenants’ form 
of life, for the sake of greater landlord profits, then any ideal incen- 
tive E provides toward such hiring of lawyers indicates a loophole. 
Such an incentive manifests a kind of incoherence: the point of 
forbidding a given deliberately imposed correlation of harms and 
benefits in one context is defeated, for no compelling reason, when 
this prohibition can be avoided by switching to another context in 
which the same deliberate imposition of the same correlation of 
harms and benefits is permitted.!! To avoid this incoherence, E must 
not allow aggregate responsibility for the conversion, or aggregate 
wrongdoing involved in initiating the conversion, to be so signifi- 
cantly reduced by the hiring of a lawyer.!” 

V. CASE 2 

The year is 2100. Like their ancestors, South Africa’s whites are 
quite affluent thanks to an abundance of black workers available at 
subsistence wages, who now, however, are nationals of sovereign 
states. This change was instituted by an earlier generation of South 
African whites, who intended that their progeny should not only 
(like themselves) be economically advantaged but should also be free 
from any special responsibility for the fate of blacks in the region. 
The former white leaders (in the 1980s) keenly appreciated that the 
prevailing conception of justice, J, allows even its minimal con- 
straints on economic relations to be undermined through the inter- 
Position of national borders. And so they made sure the blacks of 
the twenty-first century would be foreigners. 

The incentive operative here is again a moral incentive, in that the 
creation of the homelands is motivated by a feature of J: by the 
weight J gives, in its assessment of economic relations, to the exis- 
tence of states and of ties among compatriots. It corresponds to an 

1 The phrase ‘for no compelli reason’ alludes to the possibility of plausible 
counterclaims. If additional hiri of lawyers were morally important, and if the 


incentive of case 1 were a good way of promoting this important value, then there 
ight be a compelling reason for the Incentive. 


yo 
tions VI and VIJ) that the problem of loo es requires a far deeper revision of 


E. Moreover, I have speci chosen È for the purpose of giving a plausible 
illustration of a constraint that I believe should be im n any moral 
conception As will emerge, this constraint poses a to other 


moral views that are commonly held. 
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ideal (compliance) incentive, because the reform instituted by the 
former white leaders really relieves the injustice (in terms of J) in 
the relations between whites and blacks in southern Africa, and thus 
would matter even to persons who are genuinely committed to J and 
aware of all relevant facts. Finally, insofar as J condemns the earlier 
pattern (prevailing in the 1970s) on account of the disadvantaged 
position of blacks, the incentive it provides is also inherently regret- 
table: through the encouraged “reform,” those disadvantages get 
preserved, even entrenched, rather than eradicated. 

A conception of justice providing this incentive would, I think, be 
incoherent in a sense. J strongly disapproves of the pattern that 
existed in the old South Africa (of the 1970s) as violating basic 
justice: blacks are, by law or convention, barred from the more 
lucrative positions in their economy; they compete for the remain- 
ing positions at a great disadvantage due to grossly inferior school- 
ing, discriminatory hiring and promotion practices, legal prohibi- 
tions against residing in certain areas, and so on. J then approves of 
the ‘‘reformed”’ pattern, however, in which all the features and rela- 
tions relevant to condemning the old pattern as grossly unjust con- 
tinue unaltered, the only change being that the blacks are now na- 
tionals of sovereign states. And what is worse, J even encourages 
such “reform” by offering that attractive opportunity to reduce in- 
justice without foregoing its benefits. Of course, J is not strictly 
inconsistent. But it does not make sense to let the same inequalities 
continue in the context of a slightly changed pattern that can so 
easily be reached from the old pattern through a (by the lights of J) 
merely cosmetic reform, which leaves intact all advantages, disad- 
vantages, and correlations between them. 

For all I have said, an acceptable conception of justice may allow 
persons to form a state, to draw borders around it, and to exclude 
other persons from it. It may also allow that compatriots attempt— 
solely among themselves—to meet a standard of fair cooperation 
that is much more stringent than whatever minimal constraints (of 
“basic justice”) it may impose on cooperation in general. So, an 
acceptable conception of justice may allow the creation of states to 
lead to a higher standard and to a double standard. What it should 
not allow is that persons, by making others into foreigners, can 
dilute or evade even its minimal standard, which would otherwise 
govern relations between them. I say ‘minimal’, because I am not 
here worried about a conception of justice that allows a group to 
evade a higher standard, e.g., by seceding from a state that is more 
egalitarian than this conception minimally requires. My pomt con- 
cerns only whatever minimal standard a conception of justice ap- 
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plies to any pattern of interaction—even in the absence of commu- 
nal bonds, fellow feeling, a common history, shared projects, and so 
forth. However such a conception may fix this standard of basic 
justice, it should not allow groups unilaterally to circumvent or di- 
lute it. Specifically, it should not allow such a group, by forming a 
social arrangement within the larger pattern of interaction, to bring 
a new moral standard into play which may outweigh, trump, or can- 
cel the minimal protections justice requires for those left out of this 
arrangement.’* The introduction of states should not have that sort 
of moral efficacy, should not produce that sort of moral change. A 
conception of justice should not guide persons toward, and reward 
them for, deliberately creating borders in order to “just-ify” a given 
correlation of harms and benefits, i.e., in order to make it compati- 
ble with the requirements of basic justice without materially chang- 
ing it. 

Here it is possible, once again, to make a counterclaim: it is re- 
grettable that the old white leaders, by instituting the homelands, 
are able morally to avoid a genuinely reformed regime under which 
blacks would have attained legal, political, and economic equality. 
But this is not regrettable on the whole, because J’s rationale also 
includes the view that the multiplication of states has great moral 
value that here outweighs the value of nonexploitation. We can 
leave this counterclaim aside as implausible: no such rationale is 
reflected in the conceptions of justice that are seriously held in our 
society. 

This concludes the illustration of my antiloophole constraint: a 
code of ethics or justice must not without compelling reason encour- 
age changes through which its own minimal standard of common 
decency or basic justice can be deliberately undermined through 
the creation of social arrangements and the special moral ties these 
involve. 

VI AN OBJECTION 
There is a way of defending the belief in the moral efficacy of creat- 
ing social arrangements. E and J do not generate the offending 
incentive, one might claim, so long as they forbid forming or joining 
social arrangements in such cases.’ In this way one can preserve the 


"? Once national borders have been drawn in southern Africa, it will, with time, 
come to seem obvious that the white state may be o ized democratically so that 
its government will have strong incentives to promote the interests of 
its dtizens against the (however disadvan people of neighboring states, who, 
after all, have their own (however impotent) governments to take care of their 


interests. 
14 Even if E and J are understood m this way, they could still be attacked for 
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judgments that the lawyer-initiated conversion of the apartment 
complex is not wrong, and that economic relations in southern 
Africa are not unjust once the homelands have been introduced: it 
was wrong to create the relevant social arrangements with the mo- 
tives in question, but their creation nevertheless has the intended 
moral effect. E and J could then hold the creators of the social 
arrangements—those who hire lawyers or institute national borders 
in cases like 1 and 2— fully responsible for the harms inflicted upon 
outsiders. 

I consider this defense implausible on three counts. First, the 
proposed moral codes would provide no reasons to derail harms 
that are yet in the future. Thus, suppose your father dies before the 
conversion is initiated and you inherit the complex with the lawyer 
already in charge of it. Since what is wrong is only the hiring of the 
lawyer (spilt milk) and not her continued efforts on your behalf, 
neither you nor she have any moral reason to oppose the conversion 
—even though it will bring about serious harms in which you both 
are essentially involved. You and the lawyer are not just bystanders 
who are merely failing to prevent harms done by another: at least 
one of you (and no one else) is intending to increase the rental 
income, deciding that and how the conversion should take place, 
increasing the rent to its new level, evicting the tenants who cannot 
pay, and profiting from the conversion. An analogous point applies 
in case 2. 

Implausible is also, secondly, the idea of saddlmg the creators 
with open-ended posthumous responsibility. After all, the harms 
they are held responsible for (unlike future catastrophes caused by 
sloppy hazardous-waste disposal) do not arise from their conduct 
directly. Rather, these harms are, in case 1, consciously brought 
about by the lawyer who, in tum, serves at the pleasure of the 
current owner; and, in case 2, arise in the context of a pattern that is 
deliberately perpetuated by later whites. 

Third, and most importantly, it is unclear how it can be so wrong 
for your father to create the agent-client team, given that, as the 
objection contends, this team does nothing wrong. And similarly for 
the old white leaders who initiate a pattern of interaction that, by 
hypothesis, is not unjust. Here it even seems that, if anything, we 
should have to congratulate these leaders for having replaced the 





P regrettable concreis incentives. But the complaint that they contain loop- 
les is now blocked, for ideal adherents of E and f would not implement either 
your father’s idea or the homelands’ reform, if doing so were forbidden. 
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former unjust pattern by the “reformed” one—exactly as we would 
have congratulated them for creating a new, just pattern that does 
not disadvantage blacks. 

For these three reasons, we should disqualify a revision of E that 
meets the antiloophole constraint by allocating responsibility for the 
suffering of the tenants to the creator of the landlord-lawyer team 
while assigning no responsibility to this team itself. The team’s ag- 
gregate wrongdoing in initiating the conversion must not be leas 
(though it may of course be greater) than the solitary landlord’s 
wrongdoing in initiating the same conversion singlehandedly.” Simi- 
larly, we should disqualify a revision of J that meets my antiloophole 
constraint by allocating responsibility for the suffering of blacks to 
those who internationalized the pattern while finding no fault with 
the “reformed” pattern itself, nor with those who maintain it in 
existence. 

The general point is that, when matters of common decency or 
basic justice are at stake, a morality must not be sensitive to changes 
that are consciously instituted in order to secure a more favorable 
evaluation, but are (by the lights of this morality itself) merely 
cosmetic. 

VIL STRENGTHENING 

Let me proceed by sketching—again for the topic of social arrange- 
ments—a way of strengthening this initial thesis. Suppose, in varia- 
tion of case 1, that we find new evidence about your late father’s 
motivations. Although he might have hired the lawyer in order to 
enable you to reap the benefits of the conversion without wrongdo- 
ing, he actually did this because he thought you would not cope well 
with the requisite paperwork. Would this finding make it permissi- 
ble, or more permissible, for you and the lawyer to convert the 
complex? Or suppose, in a similar variation of case 2, that we are 
ignorant of why the old white leaders (of the 1980s) instituted the 
homelands. Does such ignorance make it impossible to evaluate the 
justice of the economic relations that prevail in southern Africa 
more than a century later? 

This would be quite strange. For the creators of the relevant so- 
cial arrangements are not directly involved in the production of the 
harms that are currently inflicted upon tenants and blacks, Their 
motives thus seem irrelevant to the question what moral protections 
these victims should enjoy against the landlord-lawyer team or 


the team and its creator, I am not considering in this essay bow responsibility 
should be distributed within that team. 
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against the present whites who are upholding the multinational eco- 
nomic regime in southern Africa of 2100. 

This consideration shows that it is implausible to close the loop- 
holes by the narrowest revision of E and J, which would cancel the 
moral efficacy of creating (or joining) a social arrangement if and 
only if this creation was actually motivated by the offending incen- 
tive. But if information about why a social arrangement was origi- 
nally created or joined is irrelevant, then the loopholes we have 
analyzed show more than one might have thought at first. They 
indicate not merely that the moralities in question are overly sensi- 
tive to certain changes that are, and are meant to be, merely cos- 
metic; but, more generally, that these moralities are overly sensitive 
to certain merely cosmetic differences. The loopholes indicate that 
E and J overstate not merely the moral efficacy of creating social 
arrangements, but, more generally, the moral significance of such 
arrangements themselves. 

To clarify this point, let me go through the strengthening move 
once more in a tax-code context. Suppose a tax code contains the 
provision that firms whose name begins with a ‘P’ shall be exempt 
from a certain tax. Assuming that firms can choose to name or 
rename themselves as they please, and assuming also that no serious 
purpose is served by bringing the capital ‘P’ to social prominence, 
this exemption constitutes a straightforward loophole; it provides an 
inherently regrettable ideal reward incentive. This incentive can be 
eradicated by excluding from the scope of the exemption any firms 
that deliberately name or rename themselves in order to reduce 
their tax liability. But, on reflection, this revision seems implausibly 
narrow: Why should the tax treatment of Pacific Ice-Cream (affect- 
ing its current owners) hinge on the motives of those who, perhaps 
quite some time ago, chose this name for the firm? Why should it 
hinge on any historical facts about its naming? Inherently regret- 
table ideal incentives may thus indicate the need for a broader revi- 
sion of the relevant code—one that eradicates the code’s sensitivity 
to the cosmetic difference, rather than only the opportunity to ex- 
ploit this sensitivity through cosmetic changes. 

Hence my hypothesis: a loophole in a morality supports the con- 
clusion that this morality is sensitive to a merely cosmetic difference. 
In light of case 1, we should thus question quite generally why the 
minimal moral concer tenants are owed should vary with whether 
their building is managed by a lawyer in the owner's behalf or by the 
owner himself. And case 2 should lead us to question quite generally 
why national borders should make a difference to what standards an 
economic regime must meet as a matter of basic justice. 
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We can test this hypothesis by examining two argumentative strat- 
egies that should work if this hypothesis is correct. In the next two 
sections, I shall try to make plausible that these two strategies do 
indeed have some appeal, and thereby, I hope, reinforce my hy- 
pothesis. 

VI. FICTIONAL HISTORIES 

Arguments of the first strategy consist of two steps that correspond, 
respectively, to my initial thesis and the strengthening move. Imag- 
ine someone who believes there is nothing wrong with a lawyer’s 
converting her client’s apartment complex in his behalf, even 
though it would be wrong for the landlord to initiate this conversion 
on his own. The first step in challenging this belief consists in con- 
structing a fictional history of the existing landlord-lawyer relation- 
ship which differs from the actual history by taking the creation of 
this relationship to have been motivated by the desire to exploit 
precisely the moral differential that our interlocutor is postulating. 
This fictional history may recount, for example, how the landlord’s 
father had once hired the lawyer in order to make possible a conver- 
sion without wrongdoing. Through use of arguments from sections 
IV and VI, we then try to get our interlocutor to concede that an 
acceptable morality must not encourage that fictional conduct, nor 
allow it to succeed. That is, in the fictional history the hiring of the 
lawyer fails to enable a conversion without wrongdoing. 

To transform this conclusion about a fictional case into a conclu- 
sion about the actual case, we must draw upon the strengthening 
move. The second step consists, then, in challenging the moral rele- 
vance of the difference between the actual and the fictional history: 
Why should the fact that our landlord hired the lawyer from quite 
harmless motives (perhaps because he wanted to live in another 
town) entail that he is morally entitled to reap the extra profits at the 
tenants’ expense which he would not be morally entitled to reap in 
the fictional scenario? And why should this same fact entail that the 
tenants are not morally entitled to the preservation of their commu- 
nity whereas they would be otherwise? F ailing a convincing reply to 
these questions, we are bound to conclude that the conversion with 
its actual history is wrong if the conversion with the fictional history 
is so. Both steps together show that the existence of the lawyer in the 
actual conversion episode cannot have the moral significance that 
our interlocutor had ascribed to it. 

The parallel explosion of our international justice example would 
go like this: many are initially inclined to approve of a regime in 
which each state’s citizens are free to elect whatever leaders they 
prefer and these elected leaders are expected to safeguard and pro- 
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mote the interests of the citizenry and the state, even at quite some 
cost to foreigners. People tend not to be disturbed by existing ex- 
treme international inequalities in bargaining power which engen- 
der, and are reinforced by, radical inequalities in rights and powers, 
in income and wealth, and in access to education and health care. 
Yet they do find such inequalities unjust when they occur within a 
single state. 

To challenge such a conception of justice, we first construct a 
fictional history of the present world order which differs from our 
actual history in that in it the creation of national borders was moti- 
vated by a desire to exploit precisely the moral differential that our 
interlocutors are postulating. About this fictional history we draw 
the same conclusion as we drew about the South Africa case: na- 
tional borders so created lack the alleged moral significance. Eco- 
nomic interaction on the basis of extreme inequality is as unjust 
after their creation as before. 

The second step of the challenge consists in the question how the 
moral evaluation of the present economic relations between the 
First and Third Worlds can depend on the motives of those who 
once transformed colonies into sovereign states. How can the way in 
which the actual history of our global economic arrangements 
differs from their fictional history make a moral difference to the 
claims that today’s rich and poor have upon global economic institu- 
tions? Why should economic relations on the basis of radical inequal- 
ity be any more acceptable in our world now than they would be in 
the hypothetical southern Africa of 2100? Failing convincing replies 
to these questions, we must conclude that terms of cooperation that, 
violating basic justice, would be strictly unjust within any state must 
also be unjust across national borders. 

IX. PUZZLES OF EQUIVALENCE 
I have suggested that we should avoid inherently regrettable ideal 
incentives toward creating or joining social arrangements by deny- 
ing altogether the moral significance of social arrangements in mat- 
ters of common decency and basic justice. A further test of this 
proposal is provided by an argumentative strategy that can challenge 
an approving moral judgment directly—without any reference to 
actual or fictional incentives. If someone condones a course of con- 
duct or pattern of interaction involving social arrangements, we may 
challenge this judgment by constructing a parallel scenario that 
differs from the first only in that it lacks the relevant social arrange- 
ment(s). Such a challenge may show that the judgment in question 
implicitly relies upon the moral significance of a difference that, on 
reflection, turns out to be merely cosmetic. Through this strategy 
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one can challenge the alleged moral significance of differences even 
in cases where it would not give rise to inherently regrettable incen- 
tives, i.e., where it would not be practicable to exploit this sig- 
nificance. 

Our landlord case can be recast so that it centers around the 
moral differential itself, rather than the incentives this differential 
may generate, by telling a story about two landlords, in similar cir- 
cumstances, whose apartment complexes get converted at great cost 
to existing tenants. One of the landlords employs a lawyer, who 
initiates the conversion in the owner’s behalf, while the other land- 
lord is acting on her own. Why should the first conversion (no mat- 
ter how the lawyer was originally hired) be any less wrong than the 
second? 

Let me give two sample challenges in this vein to moral judgments 
that it would be virtually impossible to attack in the fictional history 
mode. Imagine someone who believes that it is ethically permissible, 
if not obligatory, for a lawyer defending a rapist (whose guilt is not 
in doubt) to use any and every legal means likely to reduce his 
client’s sentence. Thus it is perfectly all right for him to try to divert 
the attention of the jury and the general public with various fabrica- 
tions about the victim’s sexual past and preferences which, even if 
true, would not have the slightest legal relevance. This judgment can 
be challenged by constructing a parallel scenario without attorneys, 
in which accuser and defendant present their cases in person. In this 
scenario, a man, after having raped his victim, is now trying to divert 
the attention of the jury and the general public with various fabrica- 
tions about the victim’s sexual past and preferences which, even if 
true, would not have the slightest legal relevance. The challenge is: if 
it is unethical so to slander the victim for the sake of a lighter sen- 
tence in one’s own behalf, then why should it not be unethical to do 
so in behalf of another? What is it about the existence of attorney- 
client teams which supposedly overcomes the ordinary moral claims 
of rape victims against groundless slander for the sake of a lighter 
sentence? 

The second example concerns an aspect of international relations. 
Suppose someone believes that it is permissible for the elected Presi- 
dent of the American people, acting in the best interests of his 
constituency, to bring about the deaths of many unarmed civilians in 
order to reduce dangers and losses to American soldiers. He may, 
for example, order the aerial bombardment of population centers 
so as to demoralize the government and armed forces of the enemy 
and thus to increase the chances for their surrender. This moral 
judgment can be challenged by constructing a parallel scenario 
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without social arrangements, such as the following: operating on 
their own in enemy territory, individual American soldiers cause the 
deaths of many unarmed civilians in order to reduce their own 
dangers and losses. For example, they indiscriminately throw hand 
grenades into civilian homes so as to protect themselves against the 
possibility of enemy snipers hiding inside. 

When comparing these two scenarios, we must assume, of course, 
that other things are, as far as possible, the same. In particular, both 
cases are such that the following facts obtain, and are known to 
obtain by the relevant decision makers: the U.S. involvement in the 
war is justifiable. Victory or defeat, with any cosmic consequences 
either of these might entail, are not at stake. Interspersed among 


the civilians, there is a (small) proportion of enemy soldiers (the ` 


same in both cases), who get killed as well. Both scenarios produce 
the same predictable (and rather high) ratio of their civilians killed 
to our soldiers saved: there is the same unilaterally imposed correla- 
tion of benefits for us and harms deliberately inflicted upon non- 
combatants. The challenge is then to explain why the destruction of 
civilian targets as ordered by the President on behalf of his constitu- 
ents should be ethically permissible while a similar destruction of 
civilian targets by individual soldiers is not. Why should whether the 
civilians of Hiroshima may be sacrificed for the sake of American 
soldiers hinge on whether or not the Americans make this decision 
individually or through an authorized representative? 
X. CONCLUSION 

Even if everything I have said thus far were convincing—a rather 
big ‘if’—it would compel no substantive moral conclusions. There 
is, as we have seen, vulnerability to counterclaims. A defender of the 
moral judgments I have challenged may be able to show that a 
change or difference I have diagnosed as “merely cosmetic” is mor- 
ally relevant after all. She may be able to show, for instance, that her 
morality attributes intrinsic moral value to the creation of certain 
social arrangements, so that it is morally important after all that 
lawyers be hired, states be multiplied, and so forth. 

Even if the defender can be forced to concede that the change or 
difference in question really is merely cosmetic, the revision of her 
morality might still go either way: my challenge may open her eyes to 
wrongs and injustices suffered by those excluded from the more 
powerful social arrangements that tie together the rich and mighty. 
But then it might, instead, also lead her to change her morality in 
ways that would make their suffering acceptable even in the absence 
of any social arrangements whatever. Thus, she might cease to find 
wrong even the landiord’s direct destruction of his tenants’ commu- 
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nity for the sake of extra profits. She might now condone apartheid 
even within a single state. She might come to endorse weaker ethical 
constraints even upon the investment of one’s own money. She 
might now tolerate the deliberate killing of civilians in war even 
when this is done by individual soldiers acting on their own. And she 
might convert to a libertarian conception of economic justice that 
permits radical inequalities even among compatriots. I hope, of 
course, that no one will be attracted to such revisions. But I cannot 
rule them out. The examples show then that the incoherences we 
have diagnosed through the analysis of loopholes can be repaired 
through quite diverse revisions. 

This problem highlights that I have been arguing for a structural 
constraint upon moral conceptions: I am trying to upset received 
moral convictions not by confronting them with a new set of sub- 
stantive moral intuitions, but by trying to show how we sometimes 
structure and organize our moral convictions in a way that tends to 
obstruct and undermine their practical significance. 

The character of such structural defects in a morality is likely to 
bear some relation to the position of those holding it. For people in 
the developed West, for example, it is comfortable to believe that— 
thanks to intervening borders—the moral demands on our relations 
with people in Latin America, say, are much weaker than even the 
minimal moral demands on relations with one’s compatriots. If this 
is right, the present essay may also make a contribution to the expla- 
nation of complacency: How can we live so easily with the enormous 
advantages we enjoy in our political and economic relations with 
Third-world populations? 

My main objective, however, was to sketch a way in which we 
might advance our (ordinary and academic) moral discourse. Even 
while there is a good deal of substantive disagreement, it may still be 
possible to establish some structural constraints that any acceptable 
(conception of) morality must satisfy. Agreement on such con- 
straints would tend to narrow the existing spectrum of moral de- 
bate. It would thereby make a contribution toward moral conver- 
gence, which cannot but enhance the practical significance of moral 
values. 

THOMAS W. POGGE 
Columbia University 
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Kant’s Theory of Freedom. HENRY E. ALLISON. New York: Cam- 
bridge University Press, 1990. xii + 304 p. Cloth $49.50, paper 
$15.95. 


Kant held that human beings possess transcendental freedom as well 
as merely practical freedom. That is, he held not merely that human 
action is capable of being governed by principles as opposed to 
being determined solely by the occurrence of sensory desires and 
impulses, but also that such governance of our action by principles 
is incompatible with the determinism that he took to rule the empiri- 
cal world without exception, so that in any case of action we could 
always have chosen to do otherwise than we did. The two claims are 
not identical: one can hold that it is merely a fact of nature that can 
itself be given a causal explanation that human behavior is at least 
sometimes controlled by adherence to certain principles as opposed 
to the mere presence of certain inclinations. If this is so, then the 
existence of practical freedom does not by itself entail the existence 
of transcendental freedom. It is customarily supposed that Kant 
thought that the possibility of transcendental freedom could be 
established by theoretical considerations alone, but that the reality 
of transcendental freedom could be established only by appeal not 
merely to practical reason in general but specifically to the condi- 
tions of moral imputation, which requires the possibility that human 
beings could always have chosen to do otherwise than they in fact 
did as a necessary condition for the imputation of praise and blame. 

In his new book, Henry Allison rejects this view, arguing that 
Kant thought that the reality of transcendental freedom is a neces- 
sary condition of the possibility of rational agency in general, not 
just moral agency—that is, a condition of the possibility of action 
in accordance with any kind of principle, not just the principle of 
morality—and, what is more, that Kant’s basis for this view of 
agency is essentially theoretical rather than practical. This posi- 
tion accounts for what would otherwise be the counterintuitive 
organization of Allison’s book, namely, that he begins by offering 
the argument for the reality of transcendental freedom and the 
interpretation of transcendental idealism that he holds to be nec- 
essary to make sense of transcendental freedom (in part I) before 
discussing the fundamental principle of morality that would ordi- 
narily be thought to motivate Kant’s belief in transcendental free- 
dom (in part III). (This organization can also be explained by the 


0022-862X/92/8902/99-110 © 1992 The Journal of Philosophy, Inc. 


100 THE JOURNAL OF PHILOSOPHY 


chronological fact that Kant introduces the theory of transcen- 
dental freedom in the Critique of Pure Reason before attempting 
the detailed connection of freedom to the fundamental principle 
of morality in the Groundwork of the Metaphysics of Morals and 
Critique of Practical Reason.) 

Allison’s methodological presuppositions are very different from 
those I would bring to the study of Kant’s various discussions of 
freedom. Allison, in this regard certainly like many other distin- 
guished commentators, is deeply committed to the assumption that 
the views of a great philosopher like Kant must possess a profound 
unity that can be brought out by a sympathetic interpretation. My 
own studies of Kant have persuaded me that Kant’s greatness lies 
more in some of his particular analyses and arguments and in his 
recognition of the complexity of the connections among them than 
in his pretensions to systematicity. In particular, I believe that the 
appearance that the three critiques constitute a single, consistent 
system, an appearance which Kant certainly cultivated, is deeply illu- 
sory; that in fact Kant had no thought of needing a second critique 
when he wrote the first nor of needing a third when he wrote the 
second; and that his views underwent a great deal of evolution and 
revision over the course of these three great but to some degree 
independent works. For this reason, my own interpretations of Kant 
have placed me in the more critical and revisionist camp of com- 
mentators often reviled as “patchwork” theorists. 

But it is my view that Allison’s apparently attractive methodologi- 
cal assumptions nevertheless have damaging consequences for the 
present work Part of Allison’s project is to show how the single 
conception of the transcendental idealism of the Critique of Pure 
Reason which he expounded in his 1983 book Kant’s Transcen- 
dental Idealism: An Interpretation and Defense} provides the 
framework necessary for understanding Kant’s theory of freedom, 
an exercise that is required because this particular interpretation of 
transcendental idealism seems singularly ill-suited to this task. Here 
I believe that Allison has compounded the difficulty of his task by his 
refusal to recognize that the interpretation of transcendental ideal- 
ism which he prefers, although it is certainly suggested by some of 
Kant’s remarks, is by no means the only one for which Kant argues 
and may not be the one best suited for understanding the assump- 
tions that Kant makes when it comes to the metaphysics of moral 
philosophy. But beyond this, Allison’s overarching project of unifica- 


! (New Haven: Yale, 1983), esp. chs. 1 and 2, pp. 3-34 See my critique in Kant 
and the Claims of Knowledge (New York: Cambridge, 1987), ch. 15, pp. 383-44. 
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tion in the present book, which is nothing less than the aim of find- 
ing the foundation of Kant’s moral argument for transcendental 
freedom in a single conception of agency underlying his theoretical 
as well as practical philosophy, puts the value of his work at great 
risk if this underlying conception of agency, whether or not Kant 
gives it the degree of importance that Allison does, is not very per- 
suasive. In my view, it is not, and therefore the value of interpreting 
Kant’s moral theory of freedom in light of an exclusive emphasis on 
this one idea is dubious. 

My main criticism of Allison’s book, therefore, is this.’ The theoret- 
ical proof of the reality of transcendental freedom which he finds in 
the Critique of Pure Reason may well be suggested there. But it sits 
uneasily with Kant’s claim to have shown no more than that ‘‘free- 
dom is at least not incompatible with nature” (A 558/B 586), and is 
in any case entirely inadequately expounded and defended. Further, 
although Kant may not yet have recognized the failure of this ap- 
proach in 1785 when he published the Groundwork, he clearly did 
by 1788 when he argued in the Critique of Practical Reason that 
the argument for the reality of transcendental freedom must be 
strictly practical, not theoretical. Allison’s treatment of the two ma- 
jor ethical treatises in part IIT of his book might appear to recognize 
this fact, for it looks as if he there falls back into the more traditional 
search for an argument for the reality of freedom from moral con- 
siderations alone. But in fact he tries to avoid the traditional ap- 
proach, and instead reinterprets the central argument of the Cri- 
tique of Practical Reason as turning on the conception of transcen- 
‘dental freedom as the necessary condition of any kind of agency 
which he has found in the Critique of Pure Reason. In my view this 
conception is too weak to carry the weight Allison assigns to it, and it 
is precisely because he knew this that Kant did not in fact deploy this 
conception in the second Critique. Allison thus undermines any 
natural reading of Kant’s progress in his moral theory. 

I 


Allison’s interpretation is organized around three theses. The first is 
what he calls, following the lead of Religion within the Limits of 
Reason Alone, Kant’s incorporation thesis, the view that no impulse 
or desire ever motivates human action by the mere psychological 
fact of its occurrence but does so only by the agent's taking it 
up—incorporating it—into a maxim or principle that permits or 
endorses action on that desire (5-6, 39-40). The second thesis, 
which Allison does not name but which following the model of the 
first may be called the spontaneity thesis, is the claim that the act of 
adopting a principle, whether moral or not, as well as that of incor- 
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porating a desire into one’s ends on the basis of such a principle, 
cannot be understood as a deterministic effect of some antecedent 
cause but can only be understood as a manifestation of transcen- 
dental freedom. These two theses constitute the gist of Allison’s 
interpretation of Kant’s “thick” conception of rational agency in 
general. The third thesis, which he entitles the reciprocity thesis 
(201-13), is that the ability to conform one’s behavior to the moral 
law requires transcendental freedom but also that the possession of 
transcendental freedom entails the recognition of the validity of the 
moral law. This thesis, which is certainly asserted in both the 
Groundwork and the Critique of Practical Reason, is deployed by 
Allison in an attempt to prove that Kant thought that our possession 
of rational agency in general, which suffices to prove our transcen- 
dental freedom, also entails our recognition of the validity of the 
moral law in particular. 

There can be no doubt that Kant subscribes to the incorporation 
thesis (as others have also shown), and in part II of his book Allison 
shows its importance for understanding a number of important is- 
sues in Kant’s moral psychology. These include the problem of act- 
ing from the motive of duty, where Allison resolves the dispute over 
whether such action requires the absence or inefficacy of inclina- 
tions by showing that the issue is never the mere presence of inclina- 
tions but whether they have been taken up into a maxim that is 
licensed by the moral law or one that is not (chapter 6); Kant’s 
theory of moral disposition and radical evil (chapters 7 and 8); and 
his response to Friedrich Schiller’s attempt to remove the divide 
between moral sentiment and moral principle (chapter 10), where 
again the issue can never be merely what kinds of sentiments one has 
but how they are incorporated into one’s maxims. But Allison also 
defends the metaphysical claim that the incorporation thesis implies 
the spontaneity thesis, that is, that action on principles and never 
merely per stimulos alone requires transcendental freedom. Here 
his account is unconvincing. i 

The argument that Allison finds in Kant is just this: since rational 
agency requires incorporating or “taking up” into one’s principles 
whatever inclinations may present themselves to one, or else reject- 
ing action on such inclinations because of their conflict with these 
principles, regardless of whether these are moral or merely pruden- 
tial (36), rather than merely react to their presence, such incorpora- 
tion is an act of spontaneity rather than mere receptivity, just as 

? A clear account of it, though without Allison’s title, may be found in Allen W. 


Wood, “Kant’s Compatibilism,” in Self and Nature in Kant’s Philosophy, Allen 
W. Wood, ed. (Ithaca: Cornell, 1984), esp. pp. 78-80. 
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using the faculty of understanding to conceptualize and judge data 
that are passively received by the senses is also an act of spontaneity 
rather than receptivity; and such acts of spontaneity cannot be in- 
cluded within the realm of causally determined events (36—7). Thus, 
such acts must be ascribed to the “intelligible” rather than merely 
“empirical” character of the agent. In Allison’s words, “‘to conceive 
of oneself (or someone else) as a rational agent is to adopt a model 
of deliberative rationality in terms of which choice involves both a 
taking as and a framing or positing. Since these activities, as expres- 
sions of spontaneity, are themselves merely intelligible (they can be 
thought but not experienced), it is necessary to attribute an intelligi- 
ble character to the acting subject” (38). Because of their spontane- 
ity, any acts seen as products of practical freedom (action according 
to a principle) must also be seen as products of transcendental free- 
dom (action free of determinism). 

Kant may well suggest such a position in the first Critique (see A 
546-8/B 574-6). Even so, it is clearly a poor argument. Conceptual- 
ization and judgment may well be “spontaneous” rather than “‘re- 
ceptive” in that they are not products of sensory stimulation alone 
but emerge only from internal cognitive processing of sensory stim- 
uli; but that does not imply that there is anything nonnatural or 
nondeterministic about such cognitive processing. It may just be a 
fact of nature like any other that we process our sensory stimuli the 
way we do, even if no other creatures in nature do. Likewise, al- 
though incorporating a desire into one’s maxims is “active” in a way 
that the mere occurrence of the desire itself is not, it hardly follows 
that this “activity” itself cannot be explained within a naturalistic 
framework. Without a great deal more argument, the contrast be- 
tween ‘“‘spontaneity” and “‘passivity” is only an unredeemed meta- 
phor that cannot support an inference to the real existence or even 
the mere possibility of a breach with determinism. But this contrast 
is all that Allison’s defense of the spontaneity thesis amounts to; he 
does not attempt to prove that mental “activity” is really incompati- 
ble with determinism. 

There are also problems with Allison’s attempt to defend Kant’s 
reconciliation of this account of intelligible character with the deter- 
minism that Kant supposes to prevail in the empirical realm by the 
use of bis well-known “two-aspect” rather than “two-worid” inter- 
pretation of transcendental idealism. On this account, appearances, 
where determinism rules, and things in themselves, where spontane- 
ity is possible, are not to be understood as two different sets of 
objects, but rather just as a single realm of objects represented 
through two different conceptions or points of view: the concept of 
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appearances, which includes those features necessary for us to repre- 
sent objects (“epistemic conditions”), and the concept of things in 
themselves, which excludes them. Because Kant’s argument for uni- 
versal causation shows it to be the necessary condition for time de- 
termination, but time is only an epistemic condition and therefore a 
feature of our conception of appearances but not a feature of things 
in themselves, there is supposed to be “conceptual space” for the 
conception of ourselves as things in themselves who possess tran- 
scendental freedom (44). No disturbing metaphysics but only the 
comfortable conceptual distinction between two points of view is 
therefore necessary to justify Kant’s assertion of the compatibility of 
empirical determinism and transcendental fteedom. 

Kant doubtless suggests such an interpretation of his transcen- 
dental idealism at various points in the first Critique (e.g., B 307), 
but there are also places where his arguments actually imply the 
stronger view, not merely that our concepts of things in themselves 
exclude space and time but that the things themselves are not spatial 
and temporal (e.g., A 26/B 42). It would seem to be the latter rather 
than former interpretation that is necessary to make sense of his 
claims about noumenal freedom. In any case, Allison’s defense of 
his interpretation as the basis for a defense of Kant’s theory of 
transcendental freedom does not work. 

First, when he defends Kant’s ascription of freedom rather than 
determinism to our “intelligible character” by saying that intelligible 
character is only thought but not experienced (see also 52), Allison 
ignores one of the historically most significant aspects of Kant’s 
treatment of causation: Kant’s reply to Hume’s doubts about the 
meaningfulness of our idea of causation (as opposed to those about 
the justifiability of induction) depends precisely on his recognition 
that the idea of necessary connection is not to be sought among our 
impressions, but is rather a category brought to our impressions and 
applied to them through a schema. In other words, causation and 
thus determinism are also thought and not merely experienced— 
looking for causation in mere experience was what led Hume to 
despair.” Thus one can hardly show that transcendental freedom 
and determinism are compatible merely by distinguishing between 
thought and (mere) experience. 

The problems with Allison’s version of Kant’s compatibilism go 
deeper than this, however. On the face of it, merely distinguishing 


? Or so Kant thought. Actually, Hume himself was satisfied with his account of 
the origin of the idea of connection in an internal sensory impression in 
Treatise, bk. I, pt. II, and only fell into despair when be could noe find a sensory 
impression to serve as the origin of the idea of distinct existence in pt. IV. 
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between a point of view on objects that includes determinism or the 
ground for it and one that excludes it cannot open up “conceptual 
space” for transcendental freedom, because merely abstracting 
from a particular property of an object in one of our concepts of it 
does not mean that the object does not in fact have that property. As 
Terence Irwin‘ has written: ‘Now if an event is determined, it is true 
of it under all descriptions that it is determined, even though only 
some true descriptions . . . show why it is determined” (ibid., p. 38). 
This is the most obvious objection to the use of the “two-aspect”” 
interpretation of transcendental idealism for a theory of freedom. 
Allison tries to defend himself against Irwin’s formulation of the 
objection by arguing that it “rests on the assumption that being 
causally determined is a necessary OT essential property of any occur- 
rence of which it is predicated,” and then arguing that for Kant ‘‘the 
concept of causality is merely an epistemic condition, more specifi- 
cally, a condition of the representation of objective temporal 
change” (44), which he presumably therefore considers to be merely 
an accidental or inessential property of objects. But this either 
misses the point or else in fact gives up the two-aspect interpretation 
of transcendental idealism. It misses the point if it supposes that 
Irwin’s objection depends on the assumption that determinism is 
necessarily true of human agents in all possible worlds—Irwin’s 
point is simply that, if determinism is in fact true at all of humans as 
they actually are, then it remains true of them no matter how they 
are described, for leaving determinism out of some description does 
not make it false. Adopting Allison’s apparent alternative of viewing 
our behavior as only “accidentally” rather than “essentially” deter- 
ministic does not make it nondeterministic. (Maybe it makes the 
behavior of our “counterparts” in some other “possible world” 
nondeterministic—but what would that prove about us?) Alterna- 
tively, if Allison is responding that deterministic causality is only a 
feature of our representation of ourselves in our empirical 
surroundings but not a feature of ourselves as we really are, because 
“epistemic conditions” are necessarily merely subjective and there- 
fore not true of things in themselves, then what becomes of the 
difference between the “‘two-aspect” and ‘“‘two-world” interpreta- 
tions of transcendental idealism? Allison is forced to maintain that 
determinism must be true of our representations of ourselves but 
false or at least possibly false of ourselves as we really are, where our 
representations and ourselves cannot be the same objects—which is 


4 “Morality and Personality: Kant and Green,” in Self and Nature in Kant’s 
Philosophy. 
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all that the “two-world” interpretation has ever maintained. So Alli- 
son Cannot maintain his interpretation of transcendental idealism 
yet answer Irwin’s objection. 

Finally, Allison’s argument that his interpretation of the incorpo- 
ration thesis and intelligible character frees us from traditional 
worries about the necessity of acts of choice that must be timeless 
because they are acts of noumenal rather than phenomenal agents is 
equally inadequate. His position is that because incentives “do not 
motivate by themselves causing action but rather by being taken as 
reasons and incorporated into maxims” in accord with principles 
supplied by reason, where reason “‘is not literally an efficient cause” 
(51) and the “act of incorporation,” “as an expression of spontane- 
ity, cannot be regarded as the causal consequence of antecedent 
conditions” (52), we do not have to think of the incorporation of 
desires into maxims in temporal terms at all, thus as either in time or 
timeless—we are simply in the space of reasons, not of temporal 
events. But this is clearly false. While the fact that adherence to a 
certain principle justifies or excludes action on a certain incentive 
may not itself be a temporal fact, the fact that a particular agent 
holds a certain principle and has justified or excluded action on a 
given incentive on the basis of holding this principle is just as tem- 
poral a fact as the fact that a certain desire occurs in the agent. 
Holding a principle may be a disposition rather than an event, but it 
is still temporal, as is obvious from the fact that one can start or 
cease to hold a principle, as in deciding to adopt it or surrender it. 
And of course, applying or deciding to apply a certain principle to 
a certain desire may be as much of a temporal event as the occur- 
rence of the desire itself. Even if it is not done self-consciously and 
its temporal location cannot be pin-pointed, it is still just as much a 
temporal event as the occurrence of the desire, which may not be a 
conscious and precisely datable event either. So holding that as 
rational agents we are never moved to action merely by the mere 
occurrence of incentives but rather by the incorporation of such 
incentives into our maxims or principles hardly removes our rational 
agency from the sphere of temporal discourse by itself. We must 
either accept this result, whatever its implications for freedom, or 
defend the unpleasant doctrine of timeless events of noumenal 
choice. 

I conclude, then, that although Allison may be right that Kant 
attempted to derive transcendental freedom from the spontaneity of 
thought in general in the Critique of Pure Reason, this attempt was 
an utter failure; and Allison’s own attempt to defend Kant’s account 
of transcendental freedom with his “two-aspect” interpretation of 
transcendental idealism does not fare much better. Since Allison’s 
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interpretation of Kant’s ultimate derivation of transcendental free- 
dom from moral considerations in part [II of his book rests on this 
theoretical argument offered in part I, most of that discussion also 
collapses with the failure of the latter. 
0 

Part II discusses the connections between transcendental freedom 
and the fundamental principle of morality. Allison might seem to 
fall back here into the traditional interpretation that Kant intends 
the actuality of freedom to be derived from purely moral consider- 
ations, but the purpose of his discussion is in fact to show that 
Kant’s supposedly moral argument for freedom depends not on 
traditional worries about imputation but on the spontaneity thesis. 

Allison’s discussion is structured by the customary assumption 
that Kant first intends to establish an analytical biconditional be- 
tween freedom and the moral law, and then a further synthetic prop- 
osition, either, as in the Groundwork, that we are indeed free and 
therefore bound by the moral law, or, as in the Critique of Practical 
Reason, that we are indeed bound by the moral law and therefore 
truly free. In either case, the argument begins with what Allison calls 
the reciprocity thesis, that is, Kant’s claim that only a free will could 
be capable of acting in accordance with a purely formal principle 
such as the categorical imperative but also that a free will could only 
act in accordance with such a formal principle (see 202, where Alli- 
son cites Groundwork 4:446—7 and Practical Reason $85 and 6). 

Allison treats the first conjunct of the reciprocity thesis, the claim 
that conformity to a formal principle requires a free will, as non- 
problematic, although that is not obviously the case if the freedom 
of the will is taken to be transcendental freedom. Conformity to a 
formal law, at least under some circumstances, could be explained 
by mere practical freedom, that is, the possibility of the naturalistic 
determination of action by a principle rather than mere inclination, 
unless one either assumes that the only possible empirical determi- 
nants of action are mere inclinations—a view that Allen Wood 
ascribes to Kant and Allison unpersuasively rejects (48—51)—in 
which case practical freedom is an empirical impossibility, or else 
argues that practical freedom entails transcendental freedom, which 
Allison has done in part I, with what success we have already seen. 
In any case, it is the second conjunct of the reciprocity thesis, the 
claim that possession of freedom (again assumed to be transcen- 
dental freedom) entails the validity of the moral law, which draws 
Allison’s attention. 

The most obvious problem with this proposition is that, if the 
moral law is actually taken to describe the operative causality of a 
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free will, then an agent who fails to conform to the moral law must 
lack a free will and thus not be responsible for his moral failure.’ 
The equally natural response to this problem is to say that Kant only 
means to argue that freedom of the will entails a capacity to recog- 
nize the validity of the moral law; but then this does not seem strong 
enough for his purposes, at least in the Groundwork, where he 
apparently wants to prove to us that we are bound by the moral law, 
not just capable of recognizing its validity, in virtue of our freedom. 
So Allison tries to steer a middle course, and interprets Kant’s thesis 
as equivalent to the claim that insofar as we are transcendentally 
free rational agents we necessarily recognize the validity of ‘‘criteria 
of reasonableness” (204), or require of our maxims “legitimacy for 
all rational beings in similar circumstances,” from which it would 
follow that “rational agents cannot reject the universalizability test 
without, at the same time, denying their rationality” (205). This 
might seem open to the objection that a rational agent, in particular 
a rational egoist, could recognize that all others “ought” to act on 
his own maxims without in fact having reason to will that they do so 
(206) (the rational egoist of course hopes that others will be 
irrational), but, Allison claims, this objection depends on a failure to 
recognize the depth of Kant’s conception of rational agency. He 
then appeals to his interpretation of Kant’s “thick” conception of 
rational agency to claim that a rational agent is never moved by 
impulse alone, but always incorporates his impulses into his maxims 
in virtue of a principle, and is thus, apparently, necessarily commit- 
ted to the recognition of a requirement of justifiability for his princi- 
ples (208). Here, however, his argument is obscure: it is just not 
obvious why the fact that a rational agent must always incorporate 
his desires into some principle entails that the latter must be a uni- 
versally justifiable principle. Allison is presumably not assuming 
that a transcendentally free agent is just a pure reason and does not 
actually have any incentives that could conflict with universally justi- 
fiable principles, which would depend on a metaphysical conception 
of the purely rational subject and lay Allison open to the standard 
Hegelian objections to that. But he does not offer any other explana- 
tion of why the necessity of acting on principles implies the necessity 
of acting on a universally justifiable principle. 

Allison’s defense of the reciprocity thesis is thus unconvincing. 
His criticism of Kant’s attempt to exploit the thesis in the Ground- 
work by providing a speculative proof that we have a transcenden- 
tally free will, which would therefore imply that we are bound by the 


* This objection of course goes back at least as far as Henry Sidgwick, al 
Allison associates it particularly with Robert Paul Wolff (95, 263 n. 35). 
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moral law, turning upon the alleged theoretical fact of our member- 
ship in the intelligible world, is basically sound (although it does 
follow in tracks already well-laid by commentators such as Dieter 
Henrich and Karl Ameriks). But Allison insists on interpreting the 
argument from the moral law to freedom with which the Critique of 
Practical Reason replaced the Groundwork’s argument from free- 
dom to the moral law in light of his spontaneity as well as incorpora- 
tion thesis, and here his story becomes problematic again. 

Allison does refer in passing to Kant’s assumption that moral im- 
putation requires freedom (e.g. 241), but on his account this is not 
the basis of Kant’s argument. Instead, he proceeds thus. First, he 
interprets the conception of the ‘‘fact of reason” that Kant intro- 
duces into the second Critique as “the consciousness of standing 
under the moral law and the recognition of this law by every natural 
buman reason as the supreme law of its will” (233). This seems 
correct. He then holds that this “very consciousness of the moral 
law as binding produces an interest” (239), but this leaves the prob- 
lem of showing “‘that we also have the capacity to fulfil [the] require- 
ments” of such an interest (240). On the face of it, that is, it might 
seem that we could have some disposition in behalf of the moral Jaw 
but not sufficient freedom or strength of will to comply with it. To 
avoid this problem, Allison again has Kant appeal to his incorpora- 
tion thesis as well as his spontaneity thesis: an interest, he says, like a 
maxim but opposed to a mere inclination, is necessarily a product of 
an act of practical reason, thus it is necessarily a product of sponta- 
neity, which in turn entails not only the positive freedom of con- 
ducting our behavior according to a law but also the negative free- 
dom of being free from all determinism (241). Such negative free- 
dom is apparently sufficient to ensure that we always have the 
freedom to resist whatever blandishments might stand in the way of 
our adherence to the moral law which we recognize in the fact of 
reason. Thus the mere fact of our interest in the moral law proves 
that we have the freedom to comply with it. 

This argument rests on an obscure notion of interest and the 
unexplained assumption that we could have clear evidence that we 
have such an interest in morality as opposed to a mere inclination in 
its behalf, which would not of course trigger Allison’s spontaneity 
thesis. But in any case I do not find any evidence that Kant suggests 
such an account of interest in the second Critique. It seems to me 
clear that Kant’s argument for the actuality of the freedom of the 
will rests solely on his belief that it is a necessary condition for the 
imputation of praise and blame: “the fact of reason” makes it mani- 
fest to each of us that we always ought to conform our behavior to 
the moral law, even when threatened with the gallows for so doing, 
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and we judge, “therefore, that one can do something because he 
knows that one ought, and one recognizes that he is free—a fact 
which, without the moral law, would have remained unknown to 
him” (5: 30). There is no suggestion here of any “interest” in moral- 
ity intervening between our recognition of our obligation under the 
moral law and our realization of our freedom to live up to it, or any 
hint that the latter depends upon a theoretical explanation of the 
nature of an interest. Kant just assumes that moral imputation—pre- 
sumably not only blame for vicious deeds but also praise for virtuous 
ones—would be inappropriate if the subject of imputation could 
not in fact have chosen to do otherwise than he did. This assumption 
of course needs examination, but Allison does not supply it. 

In the end, therefore, I do not find Allison’s account of Kant’s 
theory of freedom enlightening or compelling. It begins with a dubi- 
ous theoretical proof of freedom, appears to return to the more 
customary moral proof of freedom, but then obscurely and unsatis- 
factorily reinterprets that moral proof by means of the original theo- 
retical proof. In all of his work on Kant, Allison has adopted the 
stance of the sympathetic defender of Kant rather than the critical 
revisionist. But in the present book, it seems to me, he plays the role 
of revisionist himself, though unsuccessfully, by pinning his case for 
the underlying continuity of Kant’s thought on a shaky argument 
from the first Critique which there is little evidence to suggest that 
Kant retained in the second. This is no advance in the interpretation 
of Kant’s moral philosophy. We still await careful scrutiny of what 
was clearly the fundamental assumption of Kant’s final, practical 
proof of freedom, the assumption that moral imputation presup- 
poses transcendental freedom of the will, as well an answer to the 
equally fundamental question of whether Kant’s normative thesis of 
the unconditional value of human autonomy requires a metaphysical 
thesis of the freedom of the will at all. 
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NOMINALISM AND THE CONTINGENCY 
OF ABSTRACT OBJECTS* 


RADITIONAL mathematical nominalism! regards a mathe- 

matical theory as acceptable only if it can be so interpreted 

as to involve no purported quantification over or singular 
reference to abstract objects. To whatever extent a theory fails this 
test, it is viewed as literally unintelligible. Hartry Field’s? nominalism 
is atraditional in this respect. Field disbelieves in abstract objects, of 
course, and accepts that it is a constraint upon defensible mathemat- 
ical thought that it avoid commitment to them. But he does not hold 
it to be a condition on the intelligibility of a mathematical theory 
that it allow of a non-Platonist semantics. On the contrary, he is 
prepared to grant that a Platonist semantics for, e.g., number theory 
is descriptively correct: that number-theoretic statements do indeed 
purport singular reference within and quantification over a specific 
domain of abstract objects. Since, in his view, there are no such 
objects, Field has to hold that such statements’ truth conditions are 
systematically unrealized. And so, of course, he does. 

How can anyone who holds this view—that math is massively false 
—hope to save the discipline without semantic reconstruction? By 
accepting a distinction, roughly parallel to that advocated by Bas van 
Fraassen? in the philosophy of science, between endorsing a theory 


* Thanks to the ia at i Oxford, and the 1991 
Frege conference at Munich, organized Matthias at which versions of 
this Material were presented. Special to Hartry Field for much interesting 


1 See, e.g., W. V. Quine and Nelson Goodman’s classic nominalist manifesto, 
“Steps towards a Constructive Nominalism,” Journal of Symbolic Logic (1947): 
105-22. 

? As expounded in his Science without Numbers (New York: Black- 
well, 1980). 

> The Scientific Image (New York: Oxford, 1980). 
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and accepting it as true. For van Fraassen, one need not, in endors- 
ing a physical theory, be committed to more than its empirical ade- 
quacy—its correctness as far as what is observable is concerned. 
And for Field, one need not, in endorsing a mathematical theory, be 
committed to more than its conservativeness: where a theory is, 
roughly, conservative with respect to a discourse just in case any 
inferences among statements of the discourse which it mediates 
could at least in principle validly be constructed—though perhaps at 
great cost in length and notational simplicity—without appeal to 
that theory.* For Field, a mathematical theory can be acceptable, 
even if its ingredient statements have Platonistic truth conditions, 
provided it is in this way conservative with respect to nominalistic 
discourse: discourse all of whose ingredient statements have truth 
conditions that make ontological demands only within the domain of 
the acceptably concrete. Field’s program accordingly has the objec- 
tive of showing that as much as possible of classical mathematics can 
be saved as acceptable by this standard. 

A question arises, however, about the interpretation of the idea of 
conservativeness. No mathematical theory that employs a material 
conditional can be conservative unless consistent. But what under- 
standing is one who disbelieves in abstract objects to have of the 
notion of consistency? Orthodox proof-theoretic and semantic ac- 
counts each involve quantification over nominalistically forbidden 
entities—sequences of sentences in the case of proof-theoretic ac- 
counts, and models in the case of semantic ones. Field ought, there- 
fore, to forgo both of these approaches. And this is precisely what 
he does, construing consistency and, associatedly, the notion of con- 
sequence utilized in the concept of conservativeness, in primitively 
modal terms. Roughly: a statement is a consequence of others if and 
only if it is not possible, in a primitively modal sense, for it to be 
false while they are all true, and is inconsistent with others if and 
only if its negation is a consequence of them. 

These proposals have a simple corollary. A consistent theory will 
be one whose axioms are possibly collectively true, where this possi- 
bility is to be understood in Field’s primitive sense.” And a believer 


“ In Field’s own formulation: a mathematical theory S is conservative if, for any 
nominalistic assertion A and any body of such assertions N, A is not a conse- 
quence of N + Suaa E a ci eh ae al 
and Modality (New York: Blackwell, 1989), p. 125. 

* Reflect that a set of sentences—in , a mathematical theory—is con- 
Gach prided al coh oF ren Ss Cait wah Ge a 
{S,,..., Sa}. So i£ {S,S,,..., S,} is a consistent theory, then, following through 
on the foregoing construals, not-S will not be a co: of {S,,...,5,}, and 
hence it will be possible for S}, ..., Sa and not-not-S, = S, to be simultaneously 
true. 
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in the conservativeness of a particular mathematical theory is com- 
mitted, on Field’s account, to endorsing this possibility. 

The objection that is to occupy us arises at this point from the 
consideration that Field’s primitive operator of possibility is object- 
linguistic. It has to be, of course. The program would lose any 
advantage over more traditional versions of nominalism were it to 
construe claims about consistency metalinguistically. For if consis- 
tency is so construed, a nominalist who believes in the widespread 
conservativeness, and hence consistency, of mathematics is commit- 
ted to a belief in nothing other than the feasibility of the traditional 
nominalist program—of providing concrete (or otherwise nom- 
inalistically acceptable) models of mathematical theories. And that is 
just what Field wanted to distance bimself from. But when the opera- 
tor is object-linguistic, Field’s belief in the conservativeness of, e.g., 
number theory would appear to commit him to the view that the 
Peano axioms, say, might have been true as standardly interpreted, 
and hence that the nonexistence of numbers is a mere contingency. 
As a nominalist, he disbelieves in the existence of abstract objects. 
But number theory, he holds, is conservative, so consistent, so—if 
the reasoning just offered is sound—what its axioms collectively say 
might have been true. And if that had been so, there would have 
been numbers. 

This may seem odd. But why is it an objection—why should the 
contingency of numbers and other abstract mathematical objects be 
awkward for Field? For at least two reasons. First, granted that stan- 
dard mathematical theories are conservative, and that their deploy- 
ment in the formulation of, e.g., physical theories can be avoided in 
the manner Field envisages, there is a question about the possibility 
of anyone having grounds for nominalism in the first place. If 
whether numbers exist is contingent, the question is presumably 
beyond a priori resolution. An appraisal will therefore demand em- 
pirical evidence. Field agrees with the present authors, however, in 
rejecting any idea of a direct, intuitional or “perceptual” epistemol- 
ogy of abstract objects. And the very conservativeness of number 
theory, along with its dispensability in formulating scientific the- 
ories, would have the effect that there can be no nominalistically 


lines, for example, rss Meter ai Ta 
ey “What is Cantor’s Continuum lem?” in Paul Benacerraf & Hilary 
Putnam, eds., Philosophy of Mathematics: Selected Readings (New York: Cam- 


bridge, 1983, pee co Pe 483—4); or ep Penelope Maddy, 
and Mathematical Intuition,” Philosophical Review, Lxxxm (1980): 
ee a EER to these views 


here. For some of them, see Wright, a ae Be,” Revue 
Inernationale de Philosophie, cLxvu (19 A 125.79, ex esp. pp. 464-5; and Hale, 
Abstract Objects (New York: Blackwell, 19: 
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statable, indirect evidence for or against the existence of numbers. 
Whether or not the alleged contingency is realized would seem, 
therefore, to be in principle beyond evidence of any sort, and conse- 
quently utterly imponderable. So why is Field a nominalist? Why 
does he presume to a view on the matter? 

In his recent book, reacting to this thought, Field writes as fol- 
lows: “‘if the dispensability programme can be carried out’’—that is, 
if it can be shown not merely that mathematics is conservative but 
that the hypothesis of mathematical objects is dispensable without 
loss in explanations, in descriptions of our observations, in accounts 
of metalogic, and so on: 


that gives us reason to not literally believe mathematics but only to 
adopt a fictionalist attitude towards it. Admittedly, we can’t have direct 
evidence against mathematical entities. We also can’t have direct evi- 
dence against the hypothesis that there are little green people living 
inside electrons and that are in principle undiscoverable by human 
beings; but it seems to me undue epistemological caution to maintain 
agnosticism rather than flat out disbelief about such an idle hypothesis. 
I think that platonism has seemed a plausible position because it has 
been assumed that the existence of mathematical entities is not an idle 
hypothesis. But if . . . the hypothesis is dispensable without loss . . . it is 
natural to go beyond agnosticism and assert that mathematical entities 
do not exist (Realism, Mathematics and Modality, pp. 44-5). 


One observation about this response would be that the general form 
of transition it makes, from recognition of the necessary absence of 
all evidence germane to a hypothesis to a “natural” belief in its 
negation, is arguably completely unwarranted unless there is inde- 
pendent reason to think that there would be germane evidence if 
the hypothesis were true. That is precisely not the situation in the 
relevant case. But matters are worse than that. For idle hypotheses, 
in the sense Field means, have idle negations, too. So “flat out dis- 
belief’’—belief in the negation—of what is idle cannot in general 
be justified merely because it is idle. Agnosticism does not here 
reflect “undue epistemological caution”; it is demanded by mere 
consistency.” 

There is a second and deeper reason, however, why the contin- 
gency of abstract mathematical objects might be thought an objec- 
tionable outcome. This is the reflection which the contingency in 
question would seem to be, metaphysically, absolutely surd—that 


there is no prospect of any account of what the alleged contingency 


7 Or if there is a principle that sanctions disbelief in some such cases, it must 
involve reference to something more than idleness—and Field does not identify, 
still less support any such principle. 
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could be contingent on. It is deeply rooted in our ordinary concep- 
tion of contingency that there ought in general to be explanations of 
why things which might have been the case are not the case, and of 
why things are the case which might not have been the case. But in 
the present kind of instance—the existence of the integers, or real 
numbers, for example—there is not even a glimmering of how such 
an explanation might proceed. This seems a strong sign that the 
notion of contingency would be misapplied in the present context, 
and hence that there is philosophical error in any assumptions that 
commit Field to so applying it. 

Field has responded tersely to this criticism, charging that the idea 
that he suffers any such commitment depends on an equivocation. 
He writes 


Hale and Wright both note that I regard mathematics, and the exis- 
tence of mathematical entities, as consistent . . . and that I take consis- 
tency as a primitive modal notion, a sort of possibility. They then argue 
that since I regard it as false that there are mathematical entities, I 
must hold the existence of such entities to be ‘contingently false’; and 
they both proceed to interpret ‘contingently’ in some non-logical sense 
(Le. some sense other than ‘neither logically true nor logically contra- 
dictory’), to make the position seem absurd. . . . It should be noted that 
if [their] objection were good, it would apply equally well against any 
platonist who is not a logicist: that is, any platonist who agreed that 
mathematics goes beyond mere logic, and hence that the denial of 
mathematics is logically consistent and hence ‘contingent’. But 
of course it is not good, for as I’ve said it turns on an equivocation 
on the meaning of ‘possible’ (Realism, Mathematics and Modality, 
pp. 48-5).® 


Field’s response, then, is that the sense of ‘possible’ for which he 
acknowledges commitment to the possible truth of the Peano ax- 
ioms is not one that subserves the objection, which is consequently 
defused. 

But how exactly? There are two—and, so far as we can see, only 
two—possible lines of thought that would be relevant. 


ri claim that “if [our] objection Le., the Pat a falsehood objection] 


truth of ia SAE NEE E T TEE AEE tees 
sary. There is thus clear space for a version of Platonism, well short of 
logicim as Field conceives it, that is not prey to the contingent objec- 
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(A) It might be maintained (i) that Field’s preferred “austere” notion 
of logical contingency is the only one available; and (ii) that the objec- 
tion needs a richer but dubious notion of contingency. 

(B) It might be granted that there are available other, less austere but 
still respectable notions of contingency, but then maintained that while 
the nonexistence of numbers and other mathematical entities is contin- 
gent in the original austere sense, it is not a contingency—a fortiori 
not a metaphysically surd contingency—in the richer sort of sense de- 
manded by the objection. 


The considerations to follow will depend on no assumption about 
which, if either, of these lines might be Field’s own preference.’ Our 
interest is rather in the moves available in principle to the nominal- 
ism Field has advocated, irrespective of other aspects of its author’s 
actual views. We shall argue that both strategies, (A) and (B), are 
fraught with difficulty. Since Field’s claim—that the original objec- 
tion is defused by reflecting that it is only to the “logical” contin- 
gency of, e.g., numbers that his belief in the conservativeness of 
number theory commits him—requires that one or the other strat- 
egy be defensible, it should be rejected as ill-considered. 

The program is as follows. The next section will argue that claims 
A(i) and Ai) are highly implausible at best. Section II will briefly 
explore certain problems with strategy B. This will lead naturally 
into a discussion, in section III, of certain internal tensions of 


. Field’s views about the semantics of mathematical theories with the 
^ sorts of consideration by which he seemingly regards Platonism as 


scala 
ven if the original objection is not to be dismissed as relying on 
an’ ocation, the nominalist still, of course, has the option of 


biting the bullet: of allowing that the existence of numbers and 
‘other abstract objects is indeed contingent in a richer sense than 


Field’s austerely logical one and of attempting to maintain that the 
resulting position is unexceptionable. Section IV will attempt to 
deepen and refine the original objection in various respects. In par- 
ticular, we shall begin to explore whether the nominalist has any 
prospect of a satisfactory head-on response to it via play with the 
idea of brute contingency. 
L STRATEGY A 

Suppose that there were no other relevant notion of contingency 
than Field’s austerely logical notion—according to which, approxi- 
mately, a statement is contingent just in case it is neither (first- 


° Actually, Field has made it te clear in conversations and correspondence 
that he would want nothing to do with the former. 
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order) logically true nor (first-order) logically contradictory.!° How 
might someone proceed to argue—as per A(@i)—that the contin- 
gency objection would then be stifled? 

One idea might be that, if we introduce ‘it is logically contingent 
that’ as a sentential operator, simply and solely as a definitional 
abbreviation for ‘it is neither first-order logically true nor first-order 
logically contradictory that’, we have—so far, anyway—done noth- 
ing to provide a sense for statements of the form ‘the fact that . . . is 
contingent upon the fact(s) that . . .’, and so we cannot so much as 
intelligibly formulate the question, “What is the (allegedly) contin- 
gent fact (that there are no numbers) contingent upon?” Nothing in 
the way in which we would have introduced the contingency opera- 
tor would prepare for the idea that when it is (merely) contingent 
that the things are thus and so, there is—still less has to be—some- 
thing else upon which their being thus and so depends. 

This, however, has no force. Asking what the putative contin- 
gency, that there are no numbers, is contingent on is a convenient 
and natural way to raise the kind of question that fuels the objec- 
tion. But we can just as well ask why there are no numbers,’ given 
that it is, in Field’s austere sense, contingent that there are none. 
That is a perfectly well-formed and legitimate request for an expla- 
nation, for which no further preparation, supplementary to Field’s 
account of contingency, is needed. According to Field, it is logically 
contingent but false that there are numbers. Well, there are many 
statements that are logically contingent but false, and it is typically 


10 Yn fact, this oversimplifies Field’s notion. Field’s explanation (Realism, Math- 
ematics and Modality, pp. 32-8) of “LTrue” (read ‘it is logically true that’), in 
terms of the laws governing it, makes it clear that many statements involving this 
operator would rank as (austerely) logically true, and that various statements in- 
volving second-order quantification would be among them. It is thus strictly incor- 


rect to that a statement is logically contingent for Field if it is neither 
(first-order) j true not (first-order) logically contradictory. The fact re- 
mains that (austere) ical truth, conti and i are, for Field, 


effect does. The passage, which Field actually quotes (cf. Realism, Mathematics 

and Modality, p. 43), runs: “Field has no Label es of an account of what the 

contingency is contingent on. The world does not, in Field’s view, but 

ight have contained numbers. But there is no explanation of why it contains no 

numbers; and if it had contained numbers, there would have been no explanation 

of that either. There are no conditions favorable for the emergence of numbers, 
and no conditions which prevent their emergence” (Wright, op. cit., p. 465). 
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perfectly proper to ask why these statements are false. So a special 
consideration would be needed—and none is so far in sight—to 
justify rejecting the question as improper in the present case. 

Admittedly, there is a gap between its being “typically perfectly 
proper” to ask why an austerely contingent matter is as it is and 
there being, on the other, an internal conceptual connection ensur- 
ing the availability of at least an appropriate category of explanation 
for each austerely contingent matter. We make no claim to have 
closed this gap, and it is thus theoretically open to a supporter of 
Field to attempt to exploit it—more about that later. The fact re- 
mains that asking for an explanation of why an austerely logical 
contingency is unrealized is, at least in a wide class of cases, perfectly 
legitimate. If there are other austere contingencies where the de- 
mand for explanation is out of order, there ought to be some state- 
able distinction between the two sorts of cases. And argument will 
be needed that the existence of numbers belongs in the latter class. 
There is no progress otherwise. 

It is in any case very implausible that, in line with A(j), a restriction 
to austerely logical contingency could defensibly be maintained. Any 
statement is austerely logically contingent whose most explicit first- 
order formalization is neither a theorem of first-order logic nor 
refutable therein.!? So, for instance, both 


G) There are vixens weighing in excess of eighty pounds. 
and 


(ii) There are male vixens. 


rank as austerely logical contingencies. More generally, austerely 
logical contingency will embrace many statements whose truth it is 
attractive to regard as, in old-fashioned terminology, conceptually 
impossible but a first-order demonstration of whose impossibility, if 
one could be given at all, would require supplementary content- 
explicating postulates of a nonlogical sort. Now, in each of the kinds 
of example typified by (i) and (ii), respectively, the request for an 
explanation of why the statement in question is false is apparently 
perfectly intelligible, and it is easy to say how at least the beginnings 
of an answer should proceed. Given that first-order logic does not 
preclude the truth of such a statement, the question is, what does? 
And the answers are, roughly, the typical genetic make-up and feed- 
ing habits of vixens, and the nature of the concept of a vixen, respec- 
tively. 


™ Bat see note 9. 
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So much is merely common sense. But the question now is: How 
should a proponent of AG) respond to the demand for an explana- 
tion of why there are no male vixens? How to respond without 
betraying an understanding of a broader notion of contingency than 
the austerely logical variety—a notion of contingency that, if num- 
bers are so contingent, will subserve the original objection, even 
if, contrary to what was just suggested, austere logical contingency 
will not? 

Well, reflect on the possibilities. In effect, the question—“‘Why 
are there no male vixens?’’—sets a trilemma: 


Q) There is the option of rejecting the demand for an explanation as 


illegitimate. 

(i) There is the option of attempting an explanation similar to that of 
why there are no vixens weighing in excess of eighty pounds. 
(ii) There is the option of attempting an explanation that preserves a 

distinction between the two kinds of example. 


Option (i) is surely hopeless. It simply is not credible to suppose that 
there is nothing at all to be said about why there are no male vix- 
ens.!® But it would be equally incredible to suppose that a proper 
explanation might assimilate the two examples, attributing the non- 
occurrence of male vixens to causal factors, specifically, to the ge- 
netic make-up and typical environmental conditions of foxes. Some- 
one who thought that a satisfactory explanation might proceed 
along those lines would owe a story about what kind of variation in 
those factors might secure the emergence of male vixens. But there 
is no such story to tell. 

The fact is that only option (ii) is a credible response. More specif- 
ically, however indigestible the thought and unpalatable its terminol- 
ogy may be to philosophers weaned on the thin nutrient of W. V. 
Quine’s “Two Dogmas of Empiricism,”’* the only credible response 
is to acknowledge that it is not in the nature of the concepts male 
and vixen to allow coinstantiation. There may or may not be more 
to say in response to the request for explanation in this particular 
case—but that much must be at least the basis of any satisfactory 
response. 

As soon as it is granted, however, that there is work to be done by 
notions of conceptual possibility and impossibility which apply dif- 
ferentially within the class of austerely logical contingencies, house- 
room has been given to just the kind of richer notion of contingency 


13 A parent who so responded to her child's question would surely be failing to 
i ing that a responsible parent t to want to i 
“In From a Logical Point of View (Cambri : Harvard, 1980, rev. ed.). 
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that it was essential to strategy A to repudiate. Say that a statement is 
strongly contingent, just in case it is both austerely logically contin- 
gent and not conceptually impossible: no impediment to its truth is 
generated either by first-order logic or by anything internal to the 
character of the nonlogical concepts that it involves. If this notion is 
granted and, with it, the resources for a distinction, among austerely 
logical contingencies, between those like “There are vixens weighing 
in excess of eighty pounds” which are also strongly contingent and 
those like “There are male vixens” which are not, the objection 
immediately resurges in the form of a dilemma, namely, to which 
camp should “There are natural numbers” be assigned? 

If to the latter—if the existence of numbers is regarded not as a 
strong contingency but as a conceptual impossibility—then that is to 
abandon strategy A in favor of strategy B. But if to the former—if 
the existence of numbers is conceived as a strong contingency—that 
immediately invites the assimilation of ‘“There are natural numbers” 
to “There are vixens weighing in excess of eighty pounds” —a state- 
ment whose falsehood allows of a broadly causal explanation. And 
the idea that such an explanation might be possible of why there are 
no numbers is not merely in tension with our ordinary conception of 
abstract objects as acausal but is demonstrably inconsistent with the 
belief that number theory is conservative with respect to nominalis- 
tic discourse.’® 

Even if the presumption of causal explicability can be defeated, 
there is no way round the point that to conceive of the existence of 
numbers as a strong contingency is a commitment to the view that 
nothing in a correct understanding of number theory or its underly- 
ing logic poses any obstacle to its truth. The idea would have to be, 
rather, that it is merely a feature of nature that, if it is, the world is 
numberiess. And to try to take that conception of the matter seri- 
ously is simply to point one’s chin at the original question: Why has 
nature turned out that way—what kind of consideration might in 
principle explain the nonexistence of numbers once they are ac- 
knowledged to be a strongly contingently nonexisting class of enti- 


Proof. Suppose that a causal explanation of the nonexistence of numbers 
could be given Then assuming that such an explanation will be of the familiar 
deductive-nomological pattern, there will be some nominalistically formulated 
laws (briefly, L) and nominalistic statement of initial or standing conditions 
(briefly, A) from whose conjunction the statement that there are no numbers 
could be derived. We may assume that the conjunctive premise is consistent, and 
that neither conjunct alone entails this conclusion. Let N be elementary number 
theory Then, since N entails that there are numbers, not—A is a nomunalistic 
statement entailed by N + L but not by L alone, whence—bearing in mnd Field’s 
definition (cf. note 4 above)}—wN is not conservative. 
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ties? And how can it be legitimate to think of the matter as, so to 
speak, merely natural, unless that question has some kind of con- 
structive answer? 
I. STRATEGY B 

The question can be avoided by reverting to strategy B. This, recall, 
involves granting that there is, in addition to any austerely logical 
sense of contingency like Field’s, a richer notion; and then claiming 
that while the existence or nonexistence of numbers is a contingent 
matter in the former sense, it is not so in the richer sense. The 
question—‘‘Why are there no numbers?”’—would thus admit of a 
direct answer: the nominalist could simply adduce whatever consid- 
erations show that numbers are impossible in the richer sense. 

The problem, obviously, is to locate a suitable richer notion—one 
that allows successful argument that numbers and other abstracta 
are indeed richly impossible. The simplest line, naturally, would be 
to attempt to maintain that the existence of numbers is conceptually 
impossible in the relatively straightforward manner of male vixens, 
or squares with unequal sides. On this view, all existentially commit- 
ted classical arithmetical theorems would conceal definitionally elici- 
table inconsistency. One difficulty with this is that it is crucial to 
Field’s project that arithmetic be formally consistent (else it cannot 
be conservative). And presumably its formal consistency should be 
sufficiently robust not to be disrupted merely by expansion of its 
statements in the light of correct definitions of certain of the con- 
cepts involved. But there seems, in any event, no prospect of making 
the necessary case, i.e., of a successful argument that while there is 
no overt formal inconsistency in, e.g., the (normally regarded as 
true) statement that 117 < 5, it can nevertheless, with the help of 
acceptable definitions of the specifically arithmetical terms involved, 
be made to give way to explicitly contradictory consequences, in the 
manner in which the statement that there are squares with unequal 
sides gives way to an explicit contradiction on replacement of the 
geometrical term ‘square’ by its normal definiens. 

If there is to be a conceptual impossibility here, it has to be of a 
subtler sort. Actually, it is hard to see how the alleged impossibility 
could be held to derive from anything but the very abstractness of 
numbers, so that the existence of any kinds of abstract object would 
have to be impossible. Of course, taken in one way, this would 
merely inherit the difficulties just glossed. For it seems equally im- 
plausible to suppose that the very concept of abstract object is self- 
contradictory after the fashion of male vixen or four-sided trian- 
gle. But perhaps it could be argued that the concept, though not 
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self-contradictory, is nevertheless defective in such a way as to guar- 
antee that it can have no instances. 

How? Field’s writings endorse passim a number of familiar anti- 
Platonist lines.’* In particular, he contends—in company with many 
others, of course—that the acausality and lack of spatiotemporal 
position of Platonistically conceived mathematical entities make for 
serious, if not insuperable, difficulties over knowledge of and refer- 
ence to them. He also endorses the well-known arguments against 
Platonism given by Paul Benacerraf’? and by Hilary Putnam.!* Un- 
less bolstered with some substantial further premise, however, none 
of these arguments will deliver the type of conclusion that strategy B 
requires, namely, that there can be no such objects as numbers, sets, 
etc. For instance, the proper conclusion of a cogent argument based 
upon some presumed causal constraint on knowledge would be 
merely that, if there are such objects as numbers, we can have no 
knowledge concerning them. Likewise, the upshot of the arguments 
—whether based upon some proposed causal theory of reference, 
or upon the kinds of considerations adduced by Benacerraf or Put- 
nam—purporting to establish that (determinate) reference to num- 
bers is impossible, is just that, i.e., that, if there are such things as 
numbers, there is no way we can make (determinate) reference to 
them. If these arguments are to yield the further conclusion that 
there simply cannot be any such objects at all, then further premises 
are needed, perhaps directly to the effect that no objects can exist of 
which knowledge is in principle impossible or to which there cannot, 
even in principle, be identifying reference, or perhaps yielding the 
desired result some other way. In any case, it is clear that proving 
the impossibility of numbers, or abstract objects generally, is a very 
tall order which no extant line of philosophical argument meets. If 
nominalists had such an argument, we would have heard about it. 

But if the case were somehow made, there would be a further 
issue, worth briefly airing. Field, remember, has no quarrel with 
Platonism as an account of our actual understanding of mathemati- 
cal language. And this literalism is an attractive and essential ele- 
ment in his kind of nominalism, distinguishing it from nominalist 
programs of the more traditional reconstructive (and failed) sort. 
What, in his view, is wrong with Platonism is not that it misrepre- 
sents our actual understanding of mathematical statements, or 


1$ Realism, Mathematics and Modality, pp. 68-70, also pp. 23 and 230f. 

17 “What Numbers Could Not Be,” Philosophical Review, LXXIV (1965): 47— 
73. 

18 “Models and Reality,” Journal of Symbolic Logic, XLV (1980). 464-82. 
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credits them with a metaphysically impossible content, but merely 
that it tells ontological untruths. But suppose we had the impossibil- 
ity proof that strategy B demands. What repercussions ought such a 
discovery, once it were generally known, to have on the semantics of 
mathematical statements? Could it be appropriate to continue to 
take the literalist view? 

It may seem that exactly this option—combining a literal con- 
strual with denying the possible existence of mathematical entities 
—is what is secured by going fictionalist.‘No doubt it would be 
absurd to represent a community as sincerely making mathematical 
claims referring to entities that they regarded as impossible. But a 
face-value construal of their mathematical language need not in- 
volve doing so, provided we can avoid interpreting the commitments 
that their mathematical assertions express as beliefs in the truth of 
those sentences. And that is just what mathematical fictionalism en- 
ables us to do. Fictionalism opens a gap between (i) acknowledging 
that, if the belief prevails in a community that there are, necessarily, 
no abstract objects, it cannot be part of the best semantic account of 
the community’s mathematical assertions to represent their makers 
as purporting reference to and quantification over such objects, and 
(ii) abandoning the face-value construal of the terms and quantifiers 
that their mathematical language contains. The gap is opened pre- 
cisely because fictionalism makes a distinction between endorsing a 
mathematical statement and accepting it as true. 

There is a residual concern, however. Only fictionalism, it seems, 
can reconcile three crucial ingredients in Field’s philosophy of math- 
ematics: (i) a face-value construal of the semantics of number-theo- 
retic language, (ii) the—in his view— justified disbelief that any ac- 
tual objects answer to the terms and quantifiers discerned therein, 
and (iii) our endorsement of the axioms and theorems of number 
theory. But the envisaged case, remember, is one in which we are 
supposed to be rightly persuaded not merely that there are not but 
that there could not be numbers—the fiction has to be one of ac- 
knowledged impossibilia. There will therefore be issues about the 
intelligibility of mathematical language, construed at face-value, 
which fictionalism as such does nothing to address. 

The interactions between intelligibility and the perception of 
various forms of conceptual impossibility are of course subtle. One 
might believe in the incoherence of the idea of time travel, for in- 
stance, yet still, to all intents and purposes, understand and enjoy a 
fiction in which such technology plays a central part. By contrast, 
many of us are persuaded of deep conceptual singularities in the 
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idea of the Cartesian ego as a temporal continuant—persuaded, 
e.g., that there is no coherent notion of what it would be for such an 
entity to endure through a given period of time. Would there never- 
theless be no difficulty about devising an intelligible fiction in which 
such entities were to be the central—repeatedly referred to and 
quantified over—characters? Again, assuming—plausibly—that 
there are conceptual obstacles in the way of interpreting an imag- 
ined community as genuinely operating with a deviant arithmetic, 
there is a real question concerning the intelligibility and coherence 
of a fiction in which the protagonists practice an arithmetic in which 
7+5 = 13, etc.” 

Obviously, in the absence of any clear indication of the kind of 
conceptual impossibility that a successful prosecution of strategy B 
might disclose, or the kind of considerations that might be involved, 
there has to be an element of shadow boxing about this. Neverthe- 
less, there are, unquestionably, kinds of conceptual incoherence 
that would jeopardize the intelligibility even of fictional reference 
and quantification. So there can be no a priori assurance that an 
impossibility “of a more subtle sort” which pursuit of strategy B 
might disclose will not have such an effect—no assurance that the 
strategy could be prosecuted without casualty to the semantical lit- 
eralism that distinguishes Field’s position from nominalism of more 


traditional kinds. 
O. LITERALISM AND THE STANDARD ANTI-PLATONIST ARGUMENTS 


That there is cause to doubt the coherence of literalism with other 
aspects of Field’s views may be brought out by recalling some of the 
considerations to which Field actually appeals as discrediting Pla- 
tonism—in particular, the allegedly irremediable indeterminancy 
disclosed by Benacerraf’s argument and the idea that their acausal- 
ity and lack of spatial position preclude any satisfactory account of 
how reference to numbers is possible. For these considerations, if 
cogent at all, seem by themselves quite sufficient to make trouble for 
a face-value construal of, e.g., number theory, even in the context of 
an overall mathematical fictionalism. If reference to numbers is 
ruled out on conceptual grounds—whether or not the further step 
is taken of concluding that the existence of the referents themselves 
is impossible—it is immediately cast into doubt whether the face- 
value construal adds up to a coherent semantic story, let alone a 
preferable one. Nominalism-cum-literalism has it that arithmetical 


ae we can readily make sense of a fiction in which the p poagn 
ed as more or less systematically, what would, by their 
lights, reckoned ical mistakes; but that is beside the point. 
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statements, as ordinarily understood, have determinate though unre- 
alized (or only vacuously realized) Platonistic truth conditions. But 
in order for any statement to have determinate truth conditions, 
there must be something correct, ergo coherent, to say about the 
reference of its singular terms, the satisfaction conditions of its pre- 
dicates, the range of its quantifiers, and so on—and no coherent 
account of these matters, in the nature of the case, can stand in 
violation of general conditions on the very possession of such se- 
mantical characteristics. A face-value construal of mathematical 
statements—even if offered in a fictionalist spirit—is bound to of- 
fend in this respect if it involves attempting to assign reference to 
the relevant terms in a way that clashes with deep intrinsic features 
of the reference relation itself; and just that, according to propo- 
nents of the Benacerraf and acausality arguments, is what it must 
involve if it represents those terms as purporting reference to ab- 
stract objects. 

To put the matter another way: one simply has not given a seman- 
tics for a discourse if the details of the proposed account controvert 
essential features of the semantical notions that provide its very 
framework. But, according to the arguments in question, to repre- 
sent, ¢.g., the numerals as having reference, if to anything, then to 
abstract objects, is precisely so to controvert essential features of the 
relation of singular reference. Somebody who accepts such argu- 
ments should therefore hold that, rather than deliver unrealized 
(perhaps necessarily unrealizable) truth conditions for the sentences 
of arithmetic, the face-value construal gives implicit offense against 
the semantic relations on which it needs to rely in order to say 
anything determinate about what the truth conditions of arithmeti- 
cal sentences are—and thus fails to deliver anything at all. 

If this is right, then any anti-Platonist driven by the familiar argu- 
ments cited approvingly by Field is left, so far as we can see, with just 
three options. The first is traditional nominalism, according to 
which the appearance of singular reference and quantification pre- 
sented by mathematical discourse may be genuine, provided the ob- 
jects referred to and quantified over are concrete. The second is a 
reductive program to disclose that appearance as spurious, and elim- 
inate it. The third is to accommodate the appearance in the only 
other way that seems open—to go formalist and reject altogether 
the idea that mathematical statements are bona fide contentful state- 
ments having genuine truth conditions. On the latter view, such 
statements possess a determinate syntax, which may be exploited in 
formal derivations, but no semantics—a fortiori, not a Platonis- 
tic one. 
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What is not apparently an option, for anyone driven by the famil- 
iar arguments, is Field’s own view, or any view that shares the idea 
that mathematics, while a fiction, is a Platonistic fiction.” 

IV. BRUTE CONTINGENCIES 

Let us see where we have arrived. The literalist can avoid Platonism 
only at the cost of denying that mathematical theories are true. The 
question is then unavoidable: Do such theories fail of truth—specifi- 
cally, do mathematical singular terms fail of reference—necessarily 
or merely contingently? The preceding suggests that the former op- 
tion, in effect that pure mathematics is a fiction of impossibilia, is 
not the way for a supporter of Field to go. It remains to reconsider 
the alternative, that mathematical and abstract objects generally ex- 
ist or fail to exist by grace of strong though inexplicable contin- 
gency—precisely the position targeted by the main objection. 

The objection could be neutralized if it could be shown that the 
demand for explanation is here illegitimate; that there are, in any 
case, certain strongly contingent matters—matters to which the case 
of mathematical objects may property be compared—of which, how- 
ever they are resolved, there is no conceivable explanation. “No 
conceivable explanation” because not every contingency need, of 
course, be explicable in fact. For instance, if only subsumption 
under ulterior physical laws can explain the obtaining of any particu- 
lar physical law, then it will be vain to seek an explanation of any 
fundamental physical law, if such there be. Still, there will be con- 
ceptual space—as it happens unoccupied—for an explanation of 
such a law; (for the law might not have been fundamental). By con- 
trast, someone who wants to dismiss the question—‘‘Why are there 
no numbers?”—as inviting an explanation of a matter that is actu- 
ally a brute contingency, will need to meet the point that there 
seems to be no corresponding conceptual space—we seem to have 
no conception of how an answer might run in principle. 

Pursuing this direction will involve a two-stage operation: first, a 
search for other relevant examples of brute contingency—contin- 
gencies where we conceive not merely that no explanation need be 
possible of the direction in which it is resolved but, more, that there 
simply is no category of explanation that might in principle serve; 
and second, an argument that the existence of numbers, and of 
abstract objects generally, belongs with this group. What are the 
prospects? 

® Doubts along the lines of the last few are aired by Hale in “Nomi- 


nalism,” in A. D. Irvine, ed, Physicalism in Mathematics (Cambridge: Kluwer, 
1990), pp. 136-7 
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In correspondence, Field has remarked: 


On the question of why there are no numbers, my view is that this is 
like why is there matter, or why are there no immaterial minds, or why 
is there no God: you simply can’t expect there to be an answer to such 
basic existence and non-existence questions (“basic’ in the sense of: 
applied to a whole ‘ontological category’).”" 
This is a very natural thought, and it is therefore important to see 
that it comes to not very much. For unless the claim is to be simply 
dogmatic, it needs the backing of an account of “basic ontological 
category” which both discloses why explanations of “basic” matters 
of existence and nonexistence are indeed not to be expected and 
also ensures that numbers, and abstract objects generally, turn out 
to be a basic ontological category in the relevant sense. But it is not 
clear how such an account might run. Indeed, it is obscure what 
intuitive conception of a basic category covers each of the cited 
examples. Notice that certain quite attractive proposals will not sub- 
serve Field’s thought in any case. Persons, and living organisms, will 
both be basic on many accounts, including—though its author ex- 
presses some uncertainty—the largest-class-of-objects-sharing-crite- 
ria-of-identity idea of Michael Dummett.” But presumably it is not 
absurd to seek explanations both of why there are any persons and 
why there is life. 

There are, however, other cases which may be held to exhibit 
brute contingency, and so to provide the model Field needs. Con- 
sider the ageless conundrum—“‘Why is there something, rather 
than nothing?” An unanswerable question if physical existence is at 
issue and a satisfactory explanation why a physical state of affairs 
obtains has to advert to a causally prior situation in which it does not 
obtain. For a physically empty world would presumably have no 
causal progeny. Or consider the “great contingency”—the global 
state of affairs that verifies all actually true contingent proposi- 
tions.” Since the obtaining of each of its ingredients is a matter of 
contingency, the exact character of the global state is no doubt itself 
a contingency. But there can be no explaining its exact character— 
at least not if contingencies can be explained only by reference to 
further contingencies. For there are, precisely, no further contin- 
gencies to appeal to; everything further is a matter of necessity. 


2 Letter of September 8, 1990. 
2 Frege: pend ts of Language (London: Duckworth, 1973), pp. 75-6. 
2 Here we are ted to Peter Carruthers and Gideon Rosen for suggestions 
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Do such examples offer any comfort to a supporter of Field? To 
stress: it will not do just to observe that a certain sort of brute 
contingency is illustrated by examples of this sort, and is therefore 
at the call of the nominalist according to convenience. The crux will 
be whether “There are no numbers,” or “There are no abstract 
objects,” comes in the same rather special sort of category. And 
argument is required. The original objection was that the concept of 
contingency has no proper application to the cases in question, on 
the ground that contingencies invite the question—‘Why?”—and 
that we have, in the cases in question, no concept of the form a 
satisfactory answer might assume. The onus is therefore on the nom- 
inalist to show that this is a bad reason—that the correct account of 
our lack of any such concept is not that the idea of contingency is 
being stretched beyond the limits of its intelligible application, but 
that—as in the case of the two mooted examples—explanation is 
here pre-empted in perfectly intelligible ways, quite compatibly with 
the claim of contingency. 

Reflect on what would be demanded by a serious analogy with 
those two examples. There the essential work is done by certain 
restrictive principles: roughly, 

The explanation of the obtaining of a (physical) state of affairs must 

advert to a causally prior state of affairs in which it does not obtain. 


and 


The explanation of a contingency must proceed in terms of further 

contingencies. 
The key point is that such restrictions pre-empt explanation as soon 
as something sufficiently general is put up as the explanandum, and 
that this—rather than any misappropriation of the notion of contin- 
gency—is at the root of our inability to see how a satisfactory expla- 
nation of the contingencies in question might proceed. Accordingly, 
to establish a strict parallel, the nominalist needs to call attention to 
a restriction on explanations of mathematical existence which has a 
corresponding effect: our inability to explain the existence, or non- 
existence, of numbers, or abstract objects generally, should be 
traced to the fact that the demand to do so implicitly transgresses 
bounds within which a satisfactory explanation would have to work. 

The concept of mathematical explanation is a difficult one. But 
we can at least take it that explaining pure mathematical facts is 
always a matter of giving proofs, even if not all proofs are explana- 
tory. That might suggest a suitable line for the nominalist. Mathe- 


NOMINALISM 129 


matical proofs, it might be argued, accomplish demonstrations, and 
therefore explanations, only within a framework supplied by the 
basic assumptions, including assumptions of existence, made by the 
relevant mathematical theories. Mathematical why-questions, the 
suggestion continues, are internal questions: grant the framework 
and ontology presupposed by the Peano axioms, and we can explain 
the infinity of the primes; grant the framework and ontology of 
Zermelo-Fraenkel set theory and we can—or so it is sometimes 
thought—explain the truth of the Peano axioms. But just as in the 
ascent to “Why is there any physical substance at all?” or “Why is 
the totality of contingencies composed just the way it is?” escalating 
generality of this sort must sooner or later clash with-the possibility 
of explanation. It has done so by the time we ask why there are (no) 
mathematical objects. 

Although this may seem to pass muster, on closer inspection the 
analogy limps at the crucial point. What exactly is the restrictive 
principle in play? Presumably, something like, 

The explanation of any matter of mathematical existence or nonexis- 

tence can only be accomplished within a framework in which the exis- 

tence of a range of mathematical objects is assumed. 

But that principle overlooks an asymmetry in the character of mathe- 
matical explanations of existence and nonexistence, respectively. No 
doubt it is only if the existence of the natural numbers is granted 
that someone can set about trying to explain why the ratio of primes 
< n becomes smaller and smaller as n increases. But what about the 
explanation offered by Euclid’s proof of the nonexistence of a great- 
est prime? Here it is inessential to the success of the explanation to 
presuppose that there really are numbers. For the proof succeeds in 
any case: if the natural numbers exist, then, by Euclid’s reasoning, it 
is in their nature that there is no finite limit to the occurrence of 
primes; and if they do not, then there are no prime numbers and, a 
fortiori, no greatest among them. 

The point is perfectly general. Mathematical explanations of non- 
existence proceed, without existential presupposition, by showing 
that the corresponding existence claim clashes with essential aspects 
of our concept of the type of mathematical objects concerned, that 
the concept’s being fully and properly exemplified would be incon- 
sistent with the truth of the existence claim. There is no range of 
objects demanded by all such explanations, whose existence or non- 
existence is thereby pushed beyond explanation. The candidate re- 
strictive principle is false. 
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No firm conclusion can be drawn, of course, on the basis of this 
one failed comparison. But is there a better one? The heart of the 
objection is the claim that, while there is no simple connection be- 
tween contingency and the possibility of explanation, there is never- 
theless a presumptive connection: explicability is the default status, 
and the exceptions have to be explicable exceptions—something 
has to be done to show why ordinary explanation is precluded. The 
implicit principle, P, is that, for any putative (strong) contingency C, 
it should be possible either 

(a) to indicate at least a category of explanation that an explanation of 

C might in principle instantiate—to gesture at the lines along 
which an explanation of C might run, even if as a matter of fact no 
correct explanation of that kind can be given; or 

(b) to explain why there cannot be any such category—why any expla- 

nation of C would be required, by virtue of certain special features 
of the case, to meet unsatisfiable conditions. 


And the backing for the principle is merely the natural idea that 
since, when it is a (strong) contingency that C, things might have 
been otherwise, it ought to be perfectly appropriate to ask why they 
are not otherwise, i.e., why C. Since this question is appropriate only 
if a suitable category of explanation exists, the onus has to be on 
someone who accepts that no such category exists but who is deter- 
mined that C is nevertheless a (strong) contingency, to show why, in 
this special case, all space for explanation is closed off, i.e., to satisfy 
the second disjunct of the principle. This option can be nicely ac- 
complished in the case of the two examples reviewed. But we have so 
far no clear suggestion how it might be accomplished for the puta- 
tive contingency that there are no numbers.** 

All this, however, is undeniably inconclusive. Can we conjure a 
wider class of examples that may help to place the putative contin- 
gency of mathematical objects in a more luminous perspective? We 
already have 


(1) The “great contingency”; 

(2) “Why is there anything material?’ —the Conundrum; 

(3) Any fundamental scientific law, L; 

(4) The existence of God—or, presumably, of any Being whose con- 
cept, like that of the Christian God, requires that its existence de- 


pend on nothing else. 


™ It is important to remember, moreover, that, since we are at this point con- 
cerned with a theorist who regards the strong contingency of numbers as the 
correct account of their modal status, hence as something imposed by an ordinary 
understanding of number—it has to be possible to draw on agreed aspects of that 


understanding to discharge the obligations of course (b). 
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Other examples do occur. Suppose physics suggests, for instance, 
that the universe is approximately n years old. Then the question 
(5) Why did the universe begin approximately n years ago? 

will admit of no causal explanatory answer since it presupposes that, 
prior to that period, there was no physical process—hence nothing 
to determine that process should start then rather than earlier. Or 
consider an experiment in which under radioactive decomposition a 
particle escapes through one slit in a screen rather than another, 
and suppose that—so we conceive—activity at this micromaterial 
level is subject only to statistical—indeterministic—law. Then there 
need be no answer to the question 


(6) Why did the particle pass through the top slit? 

No doubt such examples put paid to any lingering attraction in the 
simple thought that, for any bona fide contingency, there has to be 
something on which it is contingent. But collectively they merely 
reinforce the presumptive connection asserted by the principle, P. 
For im each case we can either indicate quite definite lines along 
which explanation might have been possible—L might have been 
subsumable under ulterior physical law, for instance, or the laws 
governing the relevant level of micromaterial activity might have 
been deterministic—or else, as in cases (1), (2), (4), and (5), we can 
point to clear conceptual considerations which, drawing just on our 
ordinary understanding of the example, preclude explanation in 
principle. 

But there are more awkward customers. Consider any contin- 
gency, C, for which, as in cases (3) and (6), we possess a general 
conception of how an explanation might in principle proceed but 
where no correct explanation of that sort can actually be given. 
Then the circumstance—call it C*—that there is no explanation of 
C will itself be a contingency, and it will be liable to be brute. For 
how might an explanation of C* go? Not a priori, presumably, since 
C* is a contingency—there might have been an explanation of C. 
But what sort of empirical explanation might be given of the contin- 
gent absence of all empirical explanation of a given (as it happens 
fundamental) law, or of the (as it happens) indeterministic character 
of some range-of micromaterial events? Thus, for any C of the type 
illustrated by (3) and (6), the corresponding contingency of the form 
of C* already poses a problem to the taxonomy implicit in P—tt will 
be, prima facie, a case where it is clear neither what a suitable cate- 
gory of explanation might be nor that there can be, a priori, no such 
category. Thus we can add to the list, for any fundamental law L, 
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(7) There is no explanation of L’s obtaining. 
and likewise, in respect of example (6), 


(8) There is no explanation of the particle’s passing through the 

top slit. 

More generally, for any of a wide range of physical phenomena 
which we can seemingly readily conceive might have been otherwise 
and yet of which our best current conception is that they have no 
physical preconditions—the whole phenomenon of gravitational at- 
traction, for instance, or space’s being non-Euclidean, or n-dimen- 
sional; in all such cases, the claim that there is no explanation of the 
phenomenon will seemingly be itself a contingency about which we 
are clear neither along what general lines an explanation might be 
given nor about what if any conceptual considerations preclude ex- 
planation in principle. 

Well, are not these reflections enough to dislodge principle P? Do 
we not have to admit now that the intuition of contingency can 
perfectly justifiably be robust in circumstances where we are unclear 
whether either limb of principle P can be satisfactorily made out? Of 
course, these, as it were, second-order contingencies are of a special 
sort, and bear no immediate analogy to the putative contingency 
that there are no abstract objects. Still, the principle that drives the 
objection is not, it appears, exceptionless in any case. So what cause 
for skepticism if the nominalist postulates a further class of ex- 
ceptions? 

In an earlier discussion Wright wrote: 


... the idea of a contingent state of affairs that is contingent on noth- 
ing, admits of no evidence even in principle, and is beyond anything we 
could justifiably regard as cognition, is simply not a credible piece of 
metaphysics... .% 
Focusing, as we have been, on the question of explanation belongs 
with the thought that bona fide contingencies are presumptively 
contingent on something, and that the exceptions should be few and 
clearly explicable as such. But we should not lose sight of the second 
component in the quoted complaint—the utter epistemic isolation 
of the existence or nonexistence of mathematical objects, when con- 
strued as a (strong) contingency. Moreover, it is important to be 
clear that the offense given thereby need not depend on any implicit 


3 Wright, “Why Numbers Can Believably Be,” p. 465; cf. also Hale, Abstract 
Objects, pp. 111-2. 
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verificationism—any general aversion to thé idea of evidence-tran- 
scendent fact. The reason there can be no cognition of, nor evi- 
dence for or against, the existence of mathematical objects, con- 
ceived as a strong contingency, is because nothing whatever—in 
particular, no uncontroversial contingency—varies as a function of 
their existence or nonexistence, save precisely that.” It is merely 
one consequence of this that their existence or nonexistence exerts 
no differential influence on what is cognitively possible. The con- 
ception that mathematical objects exist or not as a matter of strong 
contingency thus clashes not merely with the presumption that con- 
tingencies should be contingent on something but, equally, with the 
converse principle that bona fide contingencies are presumptively 
things on which other bona fide contingencies are contingent. 

There is, of course, a vast range of ordinary unproblematic con- 
tingencies that respect both these conditions. That is the normal 
case. But what is striking is that all the examples we have consid- 
ered, even those which prima facie make trouble for P, respect the 
converse principle. There are manifold uncontroversial contingen- 
cies that would not be so but for the detail of the “great contin- 
gency,” the existence of matter, the obtaining of a particular scien- 
tific law, the universe’s being n years old, space’s being n-dimen- 
sional, the particle’s going through the top slit (it leaves traces), 
God’s existing (if he does), L’s being fundamental (for if L were not 
fundamental, other superior laws would obtain and would be mani- 
fest in differential behavior of matter), and there being no explana- 
tion why the particle went through the top slit (for if there were an 
explanation, the relevant class of phenomena would not be indeter- 
ministic and all sorts of things would be different). 

The suggestion, then, is that both the following presumptive prin- 
ciples constrain the ascription of strong contingency: 


(1) That a putative contingency will be contingent on something— 
there will be circumstances on which its obtaining depends (and 
whose citation can thus contribute to its explanation). 

(2) That a putative contingency will have things contingent on it— 
there will be circumstances whose obtaining depends on it (would 
be otherwise if it did not obtain.) 

Brute contingencies are counterexamples to the first. Call a contin- 
gency barren, by contrast, if it is a counterexample to the second. 
Then none of the brute contingencies reviewed in our list of eight 


t° If this were not so, then mathematical theories would not be conservative 
with respect to nominalistically stateable consequences. 
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seems to be barren. Uncontroversial examples of barren contingen- 
cies are, indeed, hard to muster. The occurrence of a mental state or 
episode, viewed in the special way characteristic of epiphenomenal- 
ism, would be one possible type of example. But even that would not 
also, of course, be brute. What is striking about “There are (no) 
numbers,” viewed as a strong contingency, is that, at least with re- 
spect to all other uncontroversially contingent matters, the contin- 
gency it putatively expresses is absolutely insular—is both barren 
and brute. We conjecture that there is no other example of a contin- 
gency—at least, no case that would pass without controversy as a 
contingency—that has both those features. 

It would be satisfying to disclose some definite metaphysical inco- 
herence in the notion that there can be absolutely insular contin- 
gencies. At the very least, it is at odds with the idea that the realm of 
contingency forms a single integrated system—a tree-like structure 
im which the connecting links are dependencies and in which every 
node is linked to others. There is a potential place in such a struc- 
ture for initial nodes—nodes ancestrally dependent on no others, 
which thus correspond to brute contingencies—and for terminating 
branches, whose terminal nodes thus correspond to barren contin- 
gencies. What there is no place for is a node that is both initial and 
terminal; for by forgoing any relations of dependency, such a node 
forfeits any case to be regarded as part of the structure at all. 

What should we conclude? The picture of the realm of contin- 
gency as comprising such a unified web seems to go deep enough in 
our ordinary thinking to ensure that it will be very difficult to vindi- 
cate the conception of mathematical objects as strongly contingent 
merely by description of anything we already find intuitive. But it is 
hard to avoid feeling that there is a deeper incongruity in any play 
with absolutely insular contingency, yet to be brought out. At the 
very least, a supporter of Field who goes in this direction has to 
swallow some very lumpy metaphysics for the sake of his ontological 
conscience. 

The main purpose of this paper has been to raise the question 
whether someone who takes a nominalist-cunrliteralist position has 
the resources coherently to classify the modal status of the proposi- 
tion that there are numbers, or abstract mathematical objects gener- 
ally. By contrast, the form of deflationary Platonism—Fregean Pla- 
tonism, or “Platonism for cheap,” as Field styles it—whose claims 
we have each defended elsewhere, does, we believe, provide a coher- 
ent and coherently argued view: it is that the existence of numbers is 
a necessity, whose truth may be recognized a priori. But the further 
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defense of that contention is for another occasion: in particular, we 
ought to stress that the argument of this paper is not, and is not 
meant to be, a positive argument for this, or any other, version of 
Platonism.”” On the contrary, it relies upon no assumption that 
could not be endorsed by an adherent of classical reductive nominal- 
ism, formalism, or—so far as we can see—virtually any other posi- 
tion that stands opposed to the unorthodox variety of nominalism 
against which it is directed. 
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t Not any form of antiPlatonism but only views which 
allow our things: 

G) That the sentences of number theory, analysis, etc., have sufficiently 
determinate truth conditions to give sense to the question whether they 
are satisfied or not; 

Gi) that those truth conditions are correctly reflected by the overt syntax of 
the sentences in question; 

(ii) that those truth conditions are systematically unrealized; and 

(tv) that they are so as a matter either of conceptual necessity or strong 

contingency. 
Ron an hil grr geada ai pen phair E eee o E 
paper fully succeeds. Our discussion, if fo iee at the cet widely Piada 
the most widely influential 
anti-Platonist arguments—in particular, those derivmg from causal accounts of 


singular 

of mathematical singular terms—is (or o to be) to call one or more of these 
aie pete into question. himself, for instance, rejected clause 

kit; section II argued in effect that one who holds that reference to abstract 

objects is necessarily precluded by their causal inertness cannot coherently en- 

dorse both (i) and (ii). 
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NIMAL cognition and desire, from the appetite of a clam to 

the optical systems of vultures and frigate birds, is supposed 

to have neurobiological explanations resultant from, if not 
reducible to, universal laws of physics. That is a minimal and modest 
project for epistemology naturalized, one to be assisted by special- 
ized sciences.! 

There is a larger and bolder project of epistemology naturalized, 
namely, to explain human thought in terms available to physical 
science, particularly the aspects of thought that carry truth values, 
and have formal features, like validity or mathematical form. That 
project seems to have hit a stone wall, a difficulty so grave that 
philosophers dismiss the underlying argument, or adopt a cavalier 
certainty that our judgments only simulate certain pure forms and 
never are real cases of, e.g., conjunction, modus ponens, adding, or 
genuine validity. The difficulty is that, in principle, such truth-carry- 
ing thoughts* cannot be wholly physical (though they might have a 
physical medium),* because they have features that no physical thing 
or process can have at all.* 


! After three centuries of amazingly successful science, we do not have a success- 
ful explanation of animal cognition, not even for a spider or a fish. Probably, we 
have been misconceiving the project in ways that makes science both less produc- 
tive and less helpful. 

* Thinking here means “judgmental understanding”—what Aristotle thought 


now trying to cannot be wholly physical; understandings that involve feeling 
cannot be entirely nonphysical either, any more than my going for a walk can be a 


Aquinas's commentary in Aristotle’s De Anima in the Version of William of 
Moerbeke ee ee Aquinas, Kenelm Foster and 
: Yale, 1959 .), sec. 684-6, pp. 406— 
7) that the understanding cannot have an o aa sight bas the ee ( nowa- 
See poe hinkin has: the in), because the limited physical 
states of an organ fall short of the contrasting states of understanding that 
we know we can attain. 
* Philosophers should not recoil with distaste at such remarks about thought, 
because attribute even odder features to propositions, e.g., being infinite in 
number, belonging to a tight logical network with formal features like “excluded 
middle,” and being such that every one is i either logically related, by 
implication or exclusion, or logically independent o every other; in fact, in a 
imp 
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I propose to articulate that “difficulty in principle” so as to press 
home the point that it cannot be dismissed or evaded, or the under- 
lying arguments or costs disregarded. First, the underlying argu- 
ments themselves are among the jewels of analytic philosophy (un- 
derdetermination considerations); and, secondly, to deny that our 
judgments are of definite logical forms and pure functions conflicts 
with our own certainty and with what we tell our logic, mathematics, 
and linguistics students about validity, proof, and formal syntax, and 
leaves us unable to explain what we do when we do mathematics, 


logic, or any other formal thinking. 
But now let us look at the argument: 


Some thinking (judgment) is determinate in a way no physical process 
can be. Consequently, such thinking cannot be (wholly’) a physical pro- 

_ cess. If all thinking, all judgment, is determinate in that way, no physi- 
cal process can be (the whole of) any judgment at all. Furthermore, 
“functions” among physical states cannot be determinate enough to be 
such judgments, either. Hence some judgments can be neither wholly 
physical processes nor wholly functions among physical processes. 
Certain thinking, in a single case, is of a definite abstract form 

(e.g, N X N = N°’), and not indeterminate among incompossible 

forms (see I below). No physical process can be that definite in its 

form in a single case. Adding cases even to infinity, unless they are 
all the possible cases, will not exclude incompossible forms. But 
supplying all possible cases of any pure function is impossible. So, 
no physical process can exclude incompossible functions from being 
equally well (or badly) satisfied (see II below). Thus, no physical 
process can be a case of such thinking. The same holds for functions 
among physical states (see IV below). 

I. THE DETERMINATENESS OF SOME THOUGHT PROCESSES 
Can judgments really be of such definite ‘‘pure” forms? They have 
to be; otherwise, they will fail to have the features we attribute to 
them and upon which the truth of certain judgments about validity, 
inconsistency, and truth depend; for instance, they have to exclude 
incompossible forms or they would lack the very features we take to 
be definitive of their sorts: e.g., conjunction, disjunction, syllogistic, 
modus ponens, etc. The single case of thinking has to be of an 
abstract “form” (a “pure” function) that is not indeterminate 
among incompossible ones. For instance, if I square a npumber—not 


5 But in part, yes, in the sense that my utterances are physical. Moreover, the 
thought may not even be apart from feeling or sense, just as a gesture is 
not possible without movement. The target in this paper is theories that 
thoughts are “no more PRR ie functions determined physically, not 
that, for us, they are “at least i realized.” 
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Just happen in the course of adding to write down a sum that is the 
square, but if I actually square the number—I think im the form “N 
x N = N?.” 

The same point again. I can reason in the form, modus ponens 
(“Lf p then g’; “p”; “therefore, q”). Reasoning by modus ponens 
requires that no incompossible form also be “realized” (in the same 
sense) by what I have done. Reasoning in that form is thinking in a 
way that is truth-preserving for all cases that realize the form. What 
is done cannot, therefore, be indeterminate among structures, some 
of which are not truth preserving.’ That is why valid reasoning can- 
not be only an approximation of the form, but must be of the form. 
Otherwise, it will as much fail to be truth-preserving for all relevant 
cases as it succeeds; and thus the whole point of validity will be lost. 
Thus, we already know that the evasion, “We do not really conjoin, 
add, or do modus ponens but only simulate them,” cannot be 
correct. Still, I shall consider it fully below. 

“Being truth preserving for all relevant cases” is a feature of the 
single case. The form of the reasoning that actually occurs is 
“truth-preserving,”’ regardless of which case it is. Otherwise, it 
would not be “impossible by virtue of the form to proceed from 
truth to falsity” in that reasoning (especially when the premises are 
not true). Thus, the form of the actual “encompasses” (logically 
contains) all relevant counterfactual situations. In fact, it encom- 
passes all relevant cases whatever. Without that, there is no genuine 
difference between valid and invalid reasoning. 

Squaring, conjoining, adding. I propose with some simple cases 
to reinforce the, perhaps already obvious, point that the pure func- 
tion has to be wholly realized in the single case, and cannot consist 
in the array of “inputs and outputs” for a certain kind of thinking. 
Does anyone doubt that we can actually square numbers? “‘4 times 4 
is sixteen”; a definite form (N X N = N?) is “squaring” for all 
relevant cases, whether or not we are able to process the digits, or 
talk long enough to give the answer. To be squaring, I have to be 
doing something which works for all the cases, something for which 
any relevant case can be substituted without change in what I am 
doing, but only in which thing is done. 

Size and length of computation, for example, are external to the 
form of thinking, accidental to what is done. I am squaring just in 


®T am not, of course, that a valid course of reasoning is not also a 
case of a variety of invalid forms, e.g., “P, therefore, C.” But it must determina- 
tely be a case of some valid form. 
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case my thinking is of the form mentioned: If it is of any incompos- 
sible form, or is indeterminate among incompossible forms, it is not 
of the form, “N times N = N squared.” It is not then squaring, 
however much its products may look like it, and however long a 
sequences of its outputs do. 

The fact that I cannot process every case of modus ponens, be- 
cause most of them have premises too long for me to remember, 
sentences too long to say, or words I do not understand, is adven- 
titious, like my not being able to do modus ponens in Portuguese. 
Those are features of the functors, not of the function. The func- 
tion that has to be realized in every case is the one wholly realized in 
the single case. 

That point is to be taken literally: that the function is wholly pres- 
ent, not by approximation, exemplification, or simulation, but by 
realization in the single case. To make that distinction clearer, con- 
sider an even simpler function, ‘“‘conjoining.” Conjoining is the 
functional arrangement of an n-tuple of assertions into a single as- 
sertion that is determinately true just in case every one of the n-tu- 
ple of judgments is, and false otherwise. The truth of the whole 
block is the truth of all of the units (“p q = T just in case p = T and 
q = T”). I can conjoin every sentence in the fourteenth edition of 
the Encyclopedia Britannica, or yesterday’s Times. What I do in the 
single case is what would conjoin any string of suitable units, even 
ones too long for me to think of, or beyond my access to refer to. It 
is impossible to conjoin thoughts, if what I do is indeterminate 
among incompossible forms (at the same level). 

Adding—genuinely adding, not estimating—is a sum-giving 
thought form for any suitable array of numbers.’ If I add two 
“elevens,” I am doing what would have given “forty-four” had I 
been adding two “twenty-twos” (and not making mistakes), and so 
on for every other combination of suitable numbers. I cannot be 
really adding when I do something which gives the “‘right output” 
but which cannot, by its form, determine the “right outcome” for 
any case whatever, even one on which I make a mistake. There is a 
great difference between adding incorrectly and doing something 
else, like guessing, estimating, or following a routine or algorithm. 


cannot conjoin all the sentences of English. See note 10. me 
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The adding I am talking about, like conjoining, is a form of under- 
standing. 

This is not a claim about how many states we can be in. This is a 
claim about the ability exercised in a single case, the ability to think 
in a form that is sum-giving for every sum, a definite thought form 
distinct from every other. When a person has acquired such an abil- 
ity is not always transparent from successful answers, and it can be 
exhibited even by mistakes. 

Definite forms of thought are dispositive for every relevant case 
actual, potential, and counterfactual. Yet the “function” does not 
consist in the array of inputs and outcomes.® The function is the 
form by which inputs yield outputs. The array of inputs and outputs 
for a function is the logical tail of the comet, not what the func- 
tion is.? 

The trait that determines the tail of the comet, the trait that 
“settles every relevant case, including all countercases,” marks the 
contrast with any physical process: a physical process has no feature 
that can do that. That grounds my main argument: that a necessary 
consequence of even a single case of such thinking is something that 
is logically impossible to be a consequence of any physical process, 
or function among physical processes, whatever. Thus, the activity 


of such thinking cannot be a physical process, and the ability for - 


such thinking cannot be a physical capacity. 
II THE INDETERMINACY OF THE PHYSICAL 

Now we need reasons why no physical process or function among 
physical processes can determine “the outcome” for every relevant 
case of a “pure” function. Those considerations mark some of the 
most successful analytic philosophy, from W. V. Quine, to Nelson 
Goodman, to Saul Kripke. No physical process is so definite as to 
determine among incompossible abstract functions that one rather 
than another is realized, and thus to settle for every relevant case 
what the “outcome” is to be. That indeterminacy remains no matter 


® We can even add certain nonterminating decimals, like 33333 and .66666 
ing from infinity to get 1. That is a form of understanding. 
s t but nonsynonymous functions would give the same arrays from 
inputs to outputs. Besides, a device that went to an for the answer, and 
took it out in an envelope (encoded), which it did not open (decode) but 
to you (displayed for you to decode), could be made to produce the same y of 


outputs as addition Yet it would not be addi . Besides, look at this function: 10 
Z = X*X*X; 20 Pnnt Z; 30 X = X + 1; 40 10. That is a machine function 
for an endless loop to print the cube of every number beginning with zero. You 


can see that no matter what outputs the machine gives, it might have been doing 
other than printing successive cubes, unless it produces all cCubes—. 
which cannot be done. 


~ 
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how long the physical process is “repeated,” even infinitely. In a 
word, with a machine it is indeterminate among incompossible func- 
tions what it is doing, no matter what it does.'® Therefore, no matter 
what it does, what it is doing remains formally indeterminate. Good- 
man’s!! “grue” considerations and the plus-quus adaptations by 
Kripke’? suggest the form of my argument to show that. The argu- 
ment is as follows. 

Whatever the discriminable features of a physical process may be, 
there will always be a pair of incompatible predicates, each as empir- 
ically adequate as the other, to name a function the exhibited data 
or process “satisfies.” That condition holds for any finite actual 
“outputs,” no matter how many. That is a feature of physical pro- 
cess itself, of change. There is-nothing about a physical process, or 
any repetitions of it, to block it from being a case of incompossible 
forms (“functions”), if it could be a case of any pure form at all. 
That is because the differentiating point, the point where the behav- 
ioral outputs diverge to manifest different functions, can lie beyond 
the actual, even if the actual should be infinite; e.g., it could lie in 
what the thing would have done, had things been otherwise in 
certain ways. For instance, if the function is x(*)y = (x + y, if 
y < 109m7, = x + y + 1, otherwise), the differentiating output 
would lie beyond the conjectured life of the universe. 

Just as rectangular doors can approximate Euclidean rectangular- 
ity, so physical change can simulate pure functions but cannot real- 
ize them. For instance, there are no physical features by which an 
adding machine, whether it is an old mechanical “gear” machine or 
a hand calculator or a full computer, can exclude its satisfying a 
function incompatible with addition, say, quaddition (cf. Kripke’s 


10 Postulating an infinity of cases will not suitably discriminate the functions 
that are the same for even numbers but differ for odd numbers after N. Postulat- 
ing that “all” the cases are actual involves an incoherent totality, because the 
machine cannot both do all that it does and all that it might have done instead. 


Consequently, a pure function does not reduce to a pattern of inputs and out- 


puts. 

“AI the additions” is as incoherent as “all the sets.” So “what” addition is 
cannot be explained by “all the outcomes”: rather, each and every outcome is 
determined by what addition is. It is impossible that all cases of addition be actual, 
even if infinities are performed because, even if we used up all the suitable num- 
bers, the function itself would still be repeatable, say, for the same additions, but 
now done in a different order. The function cannot be exhausted by its cases, 
however many there are. 

11 See “The New Riddle of Induction,” in Fact, Fiction and Forecast, 2nd ed. 
(Indianapolis: Bobbs-Merrill, 1968), pp. 63-86. 

12 Wittgenstein on Rules and Private Language (Cambridge: Harvard, 
1982), p. 9. and passim. 
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definition (op. cit., p. 9) of the function to show the indeterminacy 
of the single case: quus, symbolized by the plus sign in a circle, ‘‘is 
defined by: x © y = x + y, if x, y < 57, =5 otherwise”) modified so 
that the differentiating outputs (not what constitutes the difference, 
but what manifests it) lie beyond the lifetime of the machine. The 
consequence is that a physical process is really indeterminate 
among incompatible abstract functions. 

Extending the list of outputs will not select among incompatible 
functions whose differentiating “point” lies beyond the lifetime (or 
performance time) of the machine. That, of course, is not the basis 
for the indeterminacy; it is just a grue-like illustration. Adding is not 
a sequence of outputs; it is summing; whereas if the process were 
quadding, all its outputs would be quadditions, whether or not they 
differed in quantity from additions (before a differentiating point 
shows up to make the outputs diverge from sums). 

For any outputs to be sums, the machine has to add. But the 
indeterminacy among incompossible functions is to be found in 
each single case, and therefore in every case. Thus, the machine 
never adds. 

Extending the outputs, even to infmity, is unavailing. If the ma- 
chine is not really adding in the single case, no matter how many 
actual outputs seem “right,” say, for all even numbers taken pair- 
wise (see the qualifying comments in notes 7 and 10 about incoher- 
ent totalities), had all relevant cases been included, there would have 
been nonsums. Kripke drew a skeptical conclusion from such facts, 
that it is indeterminate which function the machine satisfies, and 
thus “there is no fact of the matter” as to whether it adds or not. He 
ought to conclude, instead, that it is not adding; that if it is indeter- 
minate (physically and logically, not just epistemically) which func- 
tion is realized among incompossible functions, none of them is. 
That follows from the logical requirement, for each such function, 
that any realization of it must be of it and not of an incompos- 
sible one. 

There is no doubt, then, as to what the machine is doing. It adds, 
calculates, recalls, etc., by simulation. What it does gets the name of 
what we do, because it reliably gets the results we do (perhaps even 
more reliably than we do) when we add by a distinct process. The 
machine adds the way puppets walk. The names are analogous. The 
machine attains enough reliability, stability, and economy of output 
to achieve realism without reality. A flight simulator has enough 
realism for flight training; you are really trained, but you were not 
really flying. 
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A decisive reason why a physical process cannot be determinate 
among incompossible abstract functions is “amplified grueness”’: a 
physical process, however short or long, however few or many out- 
puts, is compatible with counterfactually opposed predicates; even 
the entire cosmos is. Since such predicates can name functions from 
“input to output” for every change, any physical process is indeter- 
minate among opposed functions. This is like the projection of a 
curve from a finite sample of points: any choice has an incompatible 
competitor. 

We have no doubt that the processes in a mechanical adding ma- 
chine and in a personal computer are entirely physical. Addition 
cannot be identical with either of those physical processes because 
then it could not be done by the other. Suppose that addition is 
identical with a function among those processes. Then the processes 
would have to determine that function to the exclusion of every 
incompossible function. But they cannot do that, as the “quus,” 
“grue,” and “‘points-on-a-curve” examples show. So the machines 
cannot really add. 

Secondly, opposed functions that are infinite (that is, are a “‘con- 
version” of an infinity of inputs into an infinity of outputs) can have 
finite sequences, as large as you like, of coincident outputs; they can 
even have subsequences that are infinitely long and not different 
(e.g., functions that operate “the same” on even numbers but dif- 
ferently on odd numbers). So for a machine process to be fully 
determinate, every output for a function would have to occur. For 
an infinite function, that is impossible. The machine cannot physi- 
cally do everything it actually does and also do everything it might 
have done.!* That is the heart of the matter. The physical, as pro- 


13 There is a tary line of inquiry about immateriality. Christopher 
Cherniak argues (Minimal Rationality (Cambridge: MIT, 1986), p. 127) that 
because a physical object cannot be in an infinity of states, the mind treated as a 
brain computer is of limited understanding. That would be an understatement, 
were it true. Most of what actually happens would be unintelligible to us. An 
infinity of English sentences would be unintelligible, as would “most” truths of 
arithmetic. 


For even if each of the finite number of electrochemical states the brain 1s 
Kapabie of realig actually happened. ay, 10% different there would 
be an infmity of mathematical rems we could not even u nd because 
there would be no brain state or function among brain states to realize them. 

The popoi akan be trie Theni anonn Pati o PEDE TOTA Ene 
insofar as it has being, as Plato and Aristotle thought. And we are able to be 
in an infinity of states of understanding, not successively but qualitatively. That is, 
we have the active ability to understand anything (accidents of presentation and 
of intelligence quotient being ignored for now). Thus, there is no arithmetical 
theorem we cannot understand, accidents ignored for now. Nor are there any 
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ceas, is formally vague, no matter how far you extend it, or how 
minutely you describe its innermost mechanisms. The conclusion is 
that a physical process cannot realize an abstract function. It can at 
most simulate it. 

What Happened to Nature? Do natural processes, say, the be- 
havior of a freely falling body, not realize pure functions like 
“d = 1/2gt” and, where g = 32, “d = 16¢’? And is it not true that 
an object in empty space decreases in | in the direction it is 
traveling by an amount equal to V1 — v*/c? There are two reasons 
why such processes do not realize pure abstract functions of the 
sorts mentioned, only the second being relevant to the present dis- 
cussion. First, these laws apply by idealization. What is “the direc- 
tion” in which the object is traveling? There are no “point masses.” 
That is an idealization, as is its rest mass (say, for photons or neu- 
trinos, which are always moving at C). No object falling to earth is in 
a vacuum and under no gravitational attraction to other bodies. 
Physical phenomena often come close to our mathematizations 
which, of course, are invented to represent them. But those mathe- 
matizations are idealizations.'* That the laws are idealizations does 
not affect the present point. 

The kind of indeterminacy I am talking about is different from 
that. For the incompossible functions are equally idealizations, and 
may differ only logically because the ‘‘manifestation phenomena” lie 





ee ee 
would have to be vehicles for varying content, perhaps media for thought and not 
the same thing. 

Nothing is excluded because of its subject matter. Ours is not a successive 
infinite capacity (if we do not exist forever) but a selective infinite capacity. That is 
why the brain cannot even be the organ of thought, the way the eye is organ 


of sight, as Aristotle, Avicenna, Averroes, and many others argued; oth- 
erwise, there would be something (that t be actually) that is unintelligible. 
Our corporeality imposes accidental limitations on understanding, the most ım- 


portant of which is that our contents of judgment have to be made by demateriali- 
zation (abstraction) and our intelligence cannot directly access immaterial being 
(e.g., angels or God). Ra ena An eels 
(see note 17). 

How the dematerialization involved in our unders 
(without consideration of which thing it is, or of i pardeular mater 
tion) or our understanding something as in being P E E Ot ti 
being material) could even come about is totally beyond the resources of any 
known experimental or formal science. 

14 See Nancy Cartwright, How the Laws of Physics Lie (New York Oxford, 
1983), and Ta Hacking, Representing and Intervening (New York: Cambridge, 
1983). 
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beyond the actual (it being presupposed that all the actual phenom- 
ena accord with each function). So it is not a consequence of this 
account that there are no general and mathematizable laws of na- 
ture. Rather, just because there are general and mathematizable 
regularities, an object falling to the earth “in a vacuum” satisfies 
some incompossible function just as well as it satisfies “d = 1/2gt.” 
That is a consequence of the underdetermimation arguments. 

Now, to accept the overall argument, one does not need to deny 
that there are definite natural structures, like benzene rings, carbon 
crystals, or the structural (and behavior-explaining) molecular dif- 
ferences among procaine, novocaine, and cocaine. These are real 
structures realized in many things, but their descriptions include the 
sort of matter (atoms or molecules) as well as the “dynamic arrange- 
ment.” They are not pure functions.” i 

A musical score, say, Mahler’s 2nd Symphony, can be regarded as 
an analog computer that determines, from any given initial sound, 
the successive relative sounds and their relative lengths (within con- 
ventions of intervals and length), and thus is a function from initial 
sound onto successor sounds; yet, from the sounds (a performance) 
there is not a unique score determined among incompossible ones, 
except by convention. So, too, when we abstract the formal struc- 
ture, without matter, the physical thing (cel, molecule, gene, en- 
zyme) or process will satisfy a logically incompossible structure just 
as well. 

IO. RETREAT FROM PEOPLE 


So, to avoid the argument, someone will say: 

We do not really add, either, we just simulate addition. Pure addition is 
just as much an idealization as E = mc?. Of course, we can define such 
pure functions but cannot realize them; that is just a case of the many 
functions we can define which cannot be computed by any finite auto- 
mation, or any other computer either. In a word, the fact that there is 
no pure addition and no pure conjunction or modus ponens is no 
odder than the fact that there are no perfect triangles. 


We cannot really add, conjoin, or do modus ponens? Now that is 
expensive. In fact, the cost of saying we only simulate the pure 


15 General natures (e.g., structural steel) do “have” abstract forms, but are not 


“pure functions.” Two proteins, or ceils are the same, not by realizing 
the same abstract form, but by a structure “solid” with each individual (but not 


abstractions of those structures, many of which we can already formulate (cf. 
Scientific Tables (Basel: CIBA-GEIGY, 1970)). 
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functions is astronomical. For in order to maintain that the pro- 
cesses are basically material, the philosopher has to deny outright 
that we do the very things we had claimed all along that we do. Yet 
our doing these things is essential to the reliability of our reasoning. 
Moreover, we certainly can, Platonistically, define the ideal func- 
tions, otherwise we cannot say definitely what we cannot do. That 
exposes a contradiction in the denial that we can think in pure 
functions, however; for to define such a function is to think in a 
form that is not indeterminate among incompossible forms. To be- 
come convinced that I can only simulate the recognition that two 
Euclidean right triangles with equal sides are congruent, I have to 
judge negatively with all the determinateness that has just been de- 
nied. Each Platonistic definition of one of the processes, and each 
description of the content of logical or arithmetical judgment, is as 
definite a form of thought as any of the processes being denied; and 
each judgment that we do not do such and such a function is as 
definite in form as is conjunction, addition, or any of the judgments 
that are challenged; otherwise, what is denied would be indetermi- 
nate. It is implausible enough to say we do not really add or conjoin. 
It is beyond credibility to say we cannot definitely deny that we add, 
conjoin, assert the congruence of triangles, or define particular 
functions, like conjunction. 

The final and greatest cost of insisting that our judgments are not 
more determinate as to pure functions than physical processes can 
be, is that we can do nothing logical at all, and no pure mathematics 
either. Now, who believes that? 

There is not some parallel evidential indeterminacy between our 
activities and those of a machine whereby we cannot be sure what 
either is doing.! The machine cannot in principle add. We can be 
sure of that. And we can, and do add, and conjoin and reason 
syllogistically. We can be sure of that, too. 

Someone rejoins, “So you say. But we might be just simulating.” 
The rejoinder defeats itself. By its presumption, it grants the force 
of the argument as a whole, that there are pure functions and that, 
if certain thought processes were physical processes or functions 
among them, they would not be formally determinate. It merely 
asserts as a counterpossibility that I may think I am adding, etc., 
when I am only simulating a pure function. But to think I am adding 


16I think Kripke (op. cit., pee 21, 65, and 71) interprets what he regards as 


indeterminacy as to whether meant plus or quus as the basis for an 
indeterminacy about what I do. (“There 1s no fact of the matter. ) I say gets 
nclusions. 


the explanatory order backward and invites mistaken co 
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or conjoining, with a clear idea of what that is, is to perform a pure 
function in that very thought, whether it is true or not. 

Besides, such counterpossibilities require an ontological status for 
the pure functions simulated. We think of them and even define 
them. If that is so, then the thoughts and definitions cannot be 
indeterminate among incompatible functions because no definite 
function would then be defined by such thinking. So those function- 
determining thoughts cannot themselves be just simulations but 
have to realize pure functions, e.g., “defining addition,” “‘conceiv- 
ing modus ponens.” Hence, in order to be mistaken in a certain way, 
I have to think in exactly the way that cannot be entirely physically 
Tealized. 

To say we may not know whether we are adding, when we are, or 
squaring, when we are, is actually to grant that we might perform 
the determinate thought function that cannot be wholly physical, 
and thus to grant the whole argument. Similarly, to say, ““We do not 
know whether we ever perform a formally determinate function,” is 
to say either (a) we are in a cognitive state, “uncertainty as to 
whether we are really adding, squaring, or conjoining,” although we 
do not experience uncertainty, when we produce sums, squares, and 
simple arguments; or (b) we are always mistaken when we are certain 
we are adding, conjoining, etc., because at most we simulate. 

Now, the first option also concedes the main argument because it 
postulates uncertainty when we actually do add, etc. The second 
postulates mistakes about what we are doing, and thus concedes the 
main argument, too: that there are such definite functions for which 
the only locus must be in thought. Any other answer will leave the 
pure function without any logical space (locus). When we are certain 
we are adding, we are always wrong. But that reasoning will hold for 
whatever we do. Thus, we are always wrong about what we think we 
are doing, when we think we are doing something definite enough to 
be a pure function. To suppose we can think definitely enough 
about functions to be wrong about what we are doing concedes the 
supposition of the argument again. Now the doubt has spread to 
include every pure function: asserting, questioning, objecting, stat- 
ing, reporting, as well as adding, squaring, and conjoining. The 
doubt has even spread to include the very repeating of what I take, 
mistakenly, to be my argument and to make it indefinite whether 
you are actually denying or disputing my conclusion. Moreover, the 
cost extends to particular pure functions, specified by content: ‘‘add- 
ing three and three,” “judging that Greeks are courageous,” 
“doubting whether philosophy is scientific,” “reading a paper,” 
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“thinking this writer is mad.” Such an epidemic of doubt, without 
any effect on one’s own certainty, must involve a mistake. 

If we are always only simulating when we think we are doing 
something formally definite, then it is never determinate what we 
are doing at all. That requires that we are never doing such definite 
things at all. That is expensive, because there is no place for logic or 
mathematics or any other formal thinking at all; we cannot even 
“castle” in chess, but only “simulate” it, without any explanation for 
what “it” is or what its status is ontologically. Saint Augustine simi- 
larly objected to a “verisimilitude”’ account of truth in Contra Aca- 
demicos. The relation of simulation will not be definable without the 
prior notion of pure functions. 

If we can agree that either (1) we do have such definite thought 
processes as I described, cases of conjunction, determinate among 
all incompatible functions, and that they cannot wholly be physical 
processes (or functions among physical processes only), or (2) we 
never perform such processes but at most simulate them, then I am 
content. For I shall then wait for the counterattack to support (2), 
the one that explains the status of all those functions I cannot really 
perform and only think I can define (for to define one is to perform 
another one), and, in particular, explains the success of mathematics 
and pure logic, especially natural deduction systems and the proofs 
of completeness of propositional calculus, and offers a worked-out 
contrast between adding (which no one, apparently, can do) and 


simulating adding. 
IV. FUNCTIONAL STATES 


Kripke seemed to realize that functionalism would fail because “any 
concrete physical object can be viewed as an imperfect realization of 
many machine programs” (op. cit., pp. 36-7, n24). But it looks to 
me as if he was about to draw the wrong conclusion, when he said 
“taking a human organism as a concrete object, what is to tell us 
WHICH program he should be regarded as instantiating? In particu- 
lar, does he compute ‘plus’ or ‘quus’?”’ He should have concluded 
that, if a human is only a “concrete physical object,” then nothing 
determines, at a certain level of refinement, which program it in- 
stantiates because it instantiates none; whereas humans do add, de- 
fine, and so forth, and are thus not just concrete physical objects. 

If a “thought process,” say, adding, were a function linking actual 
physical states to “subsequent” physical states, then whatever the 
pattern of mputs to outputs, there are incompossible functions that 
link the states equally well. In that case, we could not really add. Nor 
could we deny that we add precisely. Since we can add, we know our 
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thought process is not the same as any function among brain states 
because no such function is determined (the way two points deter- 
mine a line) by physical states. 

The very step toward generality to escape the inconveniences of 
identifying an abstract process with a particular physical process, 
say, mechanical addition (with the inconvenience that there could be 
no electronic addition), creates the situation where incompossible 
general functions equally well “explain” the succession of physico- 
cognitive states, and thus discloses that no one function is realized to 
the exclusion of all the others at the physical level, and thus no pure 
function is realized at all. That guarantees that functions among 
physical states (in a process) are not the thought states because there 
are no determinate functions realized among physical states, when 
the form of thought is determinate. No real process of adding is 
identical with any process that equally well realizes an incompossible 
function. Consequently, ‘‘adding”’ is not a physical process or func- 
tion among physical states either. Besides, the functors in such func- 
tions are not physical either. For, of course, it is numbers we add, 


not numerals. 
V. ALL THOUGHT IS ABSTRACT 


The main argument is that some thought is determinate, among 
incompossible functions, the way no physical process, series of pro- 
cesses, or physically determined function among processes can be. 
The result is that such thought is never identical with any physical 
process or function. (Nor can it really be such a physical process or 
function either, though it may, for all we have said, have a material 
medium, like speech.) 

The full generalization that all thought is determinate that way is 
harder to make cogent, because it rests on one’s recognizing that, 
whatever thinking we do, whether simple assertion or hoping or 
wanting or intending (over the whole family of things each of those 
can be, according to its particular content on a particular occasion) 
is such that, in order to do that, we have to do what is the same for 
an infinity of other cases (sorted by content) that do not happen. 
For someone else might have thought or said or believed or felt the 
same in a way definite among incompossibles. So, any thinking at all 
is of general “form,” just as is adding, conjoining, reasoning validly, 
and squaring. 

By its nature, thinking has “other cases” and is therefore always 
of a definite form (which may not be articulable by us, as are mathe- 
matical and logical forms). Asserting (in any one of its senses) cannot 
be “halfway” between opposed forms; it would not be asserting 
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then. And so on, for every form of thinking. But no physical process 
or sequence of processes or function among processes can be defi- 
nite enough to realize (“pick out”) just one, uniquely, among in- 
compossible forms. Thus, no such process can be such thinking. 

The conclusion is that no physical process or sequence of pro- 
cesses or function among physical processes can be adding, squar- 
ing, asserting, or any other thinking at all.!” 


JAMES ROSS 
University of Pennsylvania 


17 All thought, as content, is immaterial in two other ways. (1) It lacks the 
ioe On uy Door” sequen foc Pa E oy. lAn opt es 


of features not contained (or implied) in the true t 
the same statement true. (2) ies a requires its truth-making 
reality to overflow the thought detail of what obtains. So every 
compliant reality is infinitely more definite then anything contingently true we 
can say about it. It takes a lakeful of reality for one drop of truth. 

A second argument: Products of processes are transcendently deter- 
minate. But no product of the ing has an infinity of content, not 
contained therein logically. So no physical can ever be such a content of 
the 2 


Some thinking PE ese fe ear aioe alan ee oa 
is as much a mode of t as it is a mode of movement; yet no movement, 
however complex, could ever make a th 

Leibniz says in section 17 of the Monadology (in Philosophical Pa and 
Letters, Leroy Loemker, ed. and trans. 2nd ed. (Dordrecht: Reidel, 1969), p. 644) 
that, if crane eee ee see produced by a machine, we ‘make 

te machine On large seale sn walk round in ke 4 mill we woul never fnd a 
perception, the movements of wheels, gears, and pulleys. Similar reasoning is 
Been Tani Conversation of Philarete and Ariste (Loemker, p. 623). I 
thank Margaret Wilson for pointing these passages out to me. 

A third argument: The cases concern the definiteness of the form of 
the thinking. A third, can be constructed from the definiteness 
of the content of thought, that t is definite PPRA AOPA EEREN 
in a way no physical proces can ever be. Similar underdetermination argumen 


do not process numbers (though we do); they process representations 
(signals). e a e ar numbers, machines do not 
add. And so on for statements, m themes, novels, plays, and arguments. 
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Leibniz’s Metaphysics: A Historical and Comparative Study. 
CATHERINE WILSON. x + 350 p. Princeton: Princeton University 
Press, 1989. Cloth $39.50. 
Leibniz and Arnauld: A Commentary on their Correspondence. 
ROBERT C. SLEIGH, JR. xvi + 237 p. New Haven: Yale University 
Press, 1990. Cloth $28.50.* 


In the last ten or fifteen years, we have seen an explosion of interest 
in the history of philosophy in the seventeenth and eighteenth centu- 
ries among Anglo-American philosophers. We have also seen a re- 
markable increase in the standards of scholarship in the field. It was 
not so long ago that Descartes could be read as if he were an Oxford 
philosopher who wrote in the quaint and slightly archaic English of 
Haldane and Ross, and secondary literature not in English could be 
dismissed, as if clear-headed philosophical thought were not possi- 
ble outside of English. Things are very different now. Responsible 
scholars must now take into account both the best available original- 
language texts, and the full range of secondary literature in French 
and German, at very least. Furthermore, work in the history of philo- 
sophy is becoming more and more historical. The scholar is ex- 
pected to know not only the central philosophical texts that a 
Descartes or a Leibniz may have written, but the minor texts as well, 
texts that previously would have been dismissed as nonphilosophical 
and of antiquarian interest only. In addition, the well-rounded 
scholar is expected to know the more general intellectual and cul- 
tural background, what was going on in less visible -figures like 
Pierre Gassendi, Sir Kenelm Digby, and Erhard Weigel, as well as 
parallel developments in areas like the history of science and the 
history of theology. Along with these changes in the field have come 
very healthy debates about how the history of philosophy is to be 
done, and what the philosophical interest of the subject is. 

The new climate for the history of philosophy is very evident in 
these two new books, Leibniz’s Metaphysics by Catherine Wilson 
and Leibniz and Arnauld by Robert C. Sleigh. Leibniz is a particu- 
larly difficult character to write about, in good part because he left 
no single text that can be regarded as definitive; the scholar of his 
thought must be conversant with a wide range of essays, letters, and 


* I would like to thank Donald Rutherford, Ariew, and Brandon Look 
for helpful comments they made on an earlier of this review. 
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notes that span more than half a century, from the early 1660s to his 
death in 1716. This is made particularly difficult by the fact that 
much of the writing remains unpublished and much that is pub- 
lished is available only in inadequate nineteenth-century editions. 
Both authors show a deep acquaintance with the corpus, and both 
have at their fingertips the latest word in textual scholarship; in- 
deed, Sleigh’s monograph makes excellent use of some yet-unpub- 
lished manuscript work on the correspondence with Arnauld. Both 
also make good use of the intellectual background to Leibniz’s 
thought, and the best of contemporary European scholarship. Fur- 
thermore, both authors are sensitive to the problem of development 
in Leibniz’s thought. Often commentators approach Leibniz with 
the assumption that for any issue there is a single Leibnizian posi- 
tion, at least in his mature writings, and it is the job of the commen- 
tator to find it out. In reality, though, Leibniz’s thought underwent 
a number of important changes throughout his long philosophical 
career; though he had his preoccupations and his commitments, he 
showed an admirable ability to try new things out, rethink old posi- 
tions, look at his thought from different points of view, and change 
in the face of criticism. Both Wilson and Sleigh show a real willing- 
ness to follow Leibniz in his philosophical meanderings, and try to 
tease out his train of thought, rather than impose some one doctrine 
on him, willing or not. 

Of the two books, Wilson’s is the broader. The book is intended as 
a general investigation into Leibniz’s metaphysics, “an examination 
of Leibniz’s system understood [not] as a collection of theses and 
principles, but developmentally and thematically” (2). More particu- 
larly, she indicates that her main interest is the problem of the con- 
tinuum, the problem as to how apparently continuous extended 
things in the world could be made up out of more fundamental 
parts. The question of historiographical methodology is also of par- 
ticular concern to Wilson. Understanding Leibniz’s development is 
central to her project, as is understanding Leibniz in his intellectual 
context. “What a philosopher means to say,” she writes, “can only 
be grasped in light of the choices actually open to him, and these are 
defined by his relation to past and present authors” (2). Wilson’s 
‘interest in development and context is evident, as is her special 
interest in the development of the metaphysics of the Monadology 
out of the somewhat different system(s) of the Discourse on Meta- 
phystcs. But the book does not really have the kind of unity that the 
author promises; it is, in reality, a collection of studies of various 
issues that relate to Leibniz’s metaphysics. Each chapter is a rela- 
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tively self-contained essay in an important topic in Leibniz’s meta- 
physics which brings in both the full range of Leibniz’s texts and 
views, and those of his contemporaries. The topics she discusses 
include Leibniz’s interest in universal languages and encyclopedias, 
his thought in the 1670s before his mature period, the metaphysical 
systems of the Discourse on Metaphysics, his interest in the meta- 
physical foundations of physics, the development of the concept of 
the monad from his earlier thought, and his conceptions of space, 
motion, and gravity. In addition, Wilson also has chapters that focus 
on metaphysical issues in Leibniz’s two philosophical books, the 
New Essays Concerning Human Understanding and the Theodicy. 
She ends with a very interesting and novel discussion on “the diffi- 
cult problem of reading Leibniz in a post-Kantian era” (306), that is, 
how we can take Leibniz seriously after Kant’s critique of dogmatic 
metaphysics. 

There are many riches in Wilson’s book. But what I would like to 
concentrate on is what she announces as her main theme, the devel- 
opment of Leibniz’s thought. 

Although development is her supposed theme, there is little to be 
found of it in her opening chapters, an account of Leibniz’s earliest 
work on logic, language, and the project of an encyclopedia (chapter 
I), followed by an account of his metaphysics and natural philosophy 
in the years that precede the Discourse on Metaphysics, what she 
calls Leibniz’s ‘‘first philosophy” (chapter II). Wilson’s exposition 
here is particularly useful, given how little material about these early 
years is generally available in English. She is particularly good at 
setting the early thought about language and logic into its proper 
intellectual context, bringing in both familiar names like Hobbes 
and Locke, and less familiar views of Kircher and Wilkins. She also 
takes note of the important early works on physics, the Theory of 
Abstract Motion and the Theory of Concrete Motion, and Leibniz’s 
so-called Paris Notes. But even though much of this early material 
provides an obvious contrast with Leibniz’s later views (as well as 
obvious anticipations of his mature doctrines), Wilson has little to 
say about how Leibniz’s thought developed in this early period, and 
how it may have moved from those early views to the later thought.’ 

Wilson’s interest in Leibniz’s development begins in earnest only 


1 On this see, ¢.g., Yvon Belaval, “Premières animadversions de Leibniz sur les 
Pray de Descartes,” in Mélanges Alexandre Koyré v. I: L’aventure de 
pint aris Hermann, 1964); Fichant, “Neue Einblicke in Leibniz’ 

rm seiner Dynamik (1678),” Studia Letbnitiana, xx (1990): 48—68; and 
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in the mid 1680s with the composition of the Discourse on Meta- 
physics, an important, though neither final nor entirely coherent 
statement of Leibniz’s first mature thought. It is in connection with 
the Discourse on Metaphysics that Wilson makes one of her most 
interesting claims: 


To the reader who approaches it free of presuppositions, the treatise 
reveals not a single system, but three separate schemes or semi-systems, 
A, B, and C, seated in the general context of a theory of the nature and 
perfections of God. Metaphysics A is the theory of individual sub- 
stance, based... upon Leibniz’s logic of descriptions. Metaphysics B is 
the theory of bodies, or corporeal substances, scholastically inspired 
but given an extra dimension by Leibniz’s theory of the vis viva. Meta- 
physics C, finally, the theory of the harmony of perceptions and ac- 
tions, has strong affinities to the system of Malebranche (80).” 


Wilson argues that these three “‘semisystems’’ are in tension with 
respect to one another, and do not fit together to form a single 
coherent system. 

Although she does not put it in exactly this way, the story Wilson 
tells of Leibniz’s development in the years that follow seems to be a 
story of how these different strands develop, and how the balance 
among them changes. 

The next major stage Wilson sees in Leibniz’s development occurs 
when he turns his attention more and more toward physics and its 
foundations. She writes: 


During the late 1680s, as Leibniz grew increasingly preoc- 
cupied with the metaphysical significance of empirical force-concepts, 
the basis of his metaphysics began to change from logic to natural 
philosophy (156). 


Wilson’s claim is that in working out his physics in the late 1680s, 
Leibniz came to emphasize more and more what she calls metaphys- 


esp. André Robinet, Architectonique pee rE automates systémiques et 
id kalité transcendentale dans l’oeuvre de G. Leibniz (Paris: Vrin, 1986). 


Leibniz’s Metaphysics (New York: Cabot forthcoming), which will focus on 
the development of the Leibnizian conception of substance from the student 
work of the 1660s to the late 1670s. 

? Although Wilson is interested in the various threads of Leibniz’s thinking in 
the Discourse, she does not make use of Henri Lestienne’s widely available edi- 
tion of that work, which presents the complete variants found in earlier versions 
of that work. 
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ics B from the Discourse on Metaphysics, a metaphysics that takes as 
basic corporeal substance, understood as a unity of active and pas- 
sive forces, which were to be identified with the form and matter of 
the schoolmen. 

The final important change Wilson recognizes occurs in the New 
System of 1695, where, according to her, Leibniz substitutes the 
notion of the monad for that of the individual substance (metaphys- 
ics A) or corporeal substance (metaphysics B) as the basis of his 
metaphysics. Wilson argues that the monad must be carefully distin- 
guished from the individual substance of metaphysics A of Leibniz’s 
Discourse. Individual substances, as Wilson understands them, are 
the logical subjects of metaphysics A, those things capable of having 
a complete individual concept. Wilson claims that “the theory of 
individual substance has been dropped from the New System. Leib- 
niz now comes to lean on metaphysics C. . . ” (159). This is what 
seems to win out in the end, Wilson reports. Metaphysics A, 
dropped by the mid-1690s, never returns; metaphysics B, Wilson 
claims, was “destined for the scrap heap of ‘popular notions’ ” 
(181). Metaphysics C, the theory of mind-like monads characterized 
by their perceptions and appetitions, the theory first unambiguously 
endorsed in the New System, is the system that predominates until 
the end, it seems. 

Wilson’s version of the development of Leibniz’s metaphysics is 
very illuminating, and it is very healthy to see such bold interpreta- 
tive hypotheses stated and defended. Wilson’s Leibniz is definitely 
not the static character found in much of the older literature. But, 
at the same time, I do not think that the story she tells is quite the 
right one. 

First of all, what Wilson identifies as metaphysics A in the Dis- 
course on Metaphysics is, I think, a bit broader than the way she 
characterizes it, and persists longer than she allows. As she charac- 
terizes it, the focus of metaphysics A is the individual substance, 
characterized as Leibniz does in section 8 of the Discourse on Meta- 
physics, in terms of the predicate-m-notion principle and the com- 
plete individual concept. That is certainly important to Leibniz in 
the period, and it certainly is fair to say that the logical conception 
of this individual substance does seem to drop out. But behind the 
particular characterization of individuality in the Discourse there is 
a much more general concern with individuality and the closely con- 
nected notion of unity in the writings of the period. The later letters 
in Leibniz’s correspondence with Arnauld are dominated by a con- 
cern with the importance of there being genuine individuals in the 
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world. The argument that comes up over and over again is what 
might be called the aggregate argument: if there are to be com 
plexes (aggregates) in the world (which bodies must be because of 
their divisibility), then there must be genuine individuals.’ This ar- 
gument persists throughout Leibniz’s career, and can be seen as late 
as the opening sections of the Monadology and Principles of Na- 
ture and Grace. 

A second reservation I have about Wilson’s story concerns an 
aspect of Leibniz’s thought in the Discourse on Metaphysics and 
beyond (not to mention before) which gets inadequate attention in 
Wilson’s presentation of the main lines and evolution of Leibniz’s 
thought. The denial of occasionalism, the doctrine that God is the 
only genuine cause in the world and the assertion of a world of 
genuinely active individuals, is one of Leibniz’s most important 
themes. It is clearly present in the Discourse on Metaphysics. The 
infamous section 8 begins by noting that “it is rather difficult to 
distinguish the actions of God from those of his creatures,’* and 
claiming that the discussion that follows will clear this matter up; 
indeed, the logical theory of substance in the sections that follow 
can plausibly be read as an attempt not only to outline a logical 
notion of individuality (Wilson’s metaphysics A), but to put forward 
a refutation of occasionalism based on undeniable logical principles. 
His point is that, if every individual substance has a complete individ- 
ual concept, a concept that includes everything true of the individ- 
ual, past, present, and future, then there must be something in that 
individual substance by virtue of which the complete individual con- 
cept pertains to it, an individual nature that answers to the complete 
individual concept. And if so, then we do not have to look outside of 
the individual (even to God) to explain any of its accidents. The 
refutation of occasionalism is also a central theme in Leibniz’s 
correspondence with Arnauld, in the “Specimen Dynamicum’’ of 


3 For an account of the argument, see Daniel Garber, “Leibniz and 
the Foundations of Physics: Middle Years,” in K. Okruhlik and J. Brown eds., 
The Natural Ei of Leibniz (Dordrecht: Reidel, 1985), $1. This argument 
is connected with Lefbniz’s mature to the of the compo- 
sition of the continuum. Although as a theme m the beginning of the 
book, and alth Wilson gives a serious treatment of Leibaia’s early thoughts 
on the problem ; 74-8), Leibniz’s mature race CTS: ie ice aca 
prominent in Wilson’s tion (see 3, 102-3, 18 

“C. I. Gerhardt, ed., G. W. Leibniz: Dis philosophischen Schriften (Berlin: 
Weidmannsche Buchhandlung, 1875—90), vol. 4, p. 432, in R. Ariew and D. 
Garber, eds. and trans., Leibniz: Philosophical Essays (Indianapolis: Hackett, 
1989), p. 40. 
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1695, in the “On Nature Itself’ of 1698, and in numerous other 
writings.” 

The notion of activity and the refutation of occasionalism is every 
bit as important as the themes that Wilson chooses to emphasize in 
the Discourse on Metaphysics and beyond. Now, Wilson does not 
ignore the issue of occasionalism altogether (see, e.g., 111). But 
because of a certain confusion in her treatment, it does not get the 
prominence that it deserves. Wilson closely connects the issue of 
God’s continual recreation of the world from moment to moment, 
and the doctrine of occasionalism (see 148ff; 169ff.). And indeed, 
there is a strong temptation to do so. For many of the later Carte- 
sians (though probably not for Descartes himself), the doctrine of 
God’s sustenance of the world through continual recreation is the 
ground of their view that God is the only genuine cause in the 
world.® But Leibniz’s view is different. For Leibniz, God must create 
the world, and must then sustain the world that he created. But what 
he creates and sustains is a world of active individuals. Once we 
separate the issues, then Leibniz’s position becomes much clearer. A 
resolute foe of occasionalism from his earliest writings,’ his project 
is figuring out how to maintain this without rejecting the doctrine of 
divine substance from moment to moment. But because Wilson does 
not carefully separate the issue of divine sustenance from 
occasionalism, she takes Leibniz’s commitment to divine sustenance 
as evidence of his commitment to occasionalism. Her view seems to 
be that Leibniz does not really reject occasionalism in the Discourse 
on Metaphysics, and argues that even in later writings (as late as the 
Monadology in her view) Leibniz is not sure on that issue, a position 
that cannot be right (see 169-70). 

If we read metaphysics A in an appropriately broad way, and add 
activity to Wilson’s three strands of Leibniz’s metaphysics, then we 
have, I think, a reasonably good list of Leibniz’s main metaphysical 


* In the correspondence with Arnauld, see, e. » Gerhardt, vol. 2, pp. 92-4, in 
Ariew and Garber, pp. 82—4; in the “Specimen see Gerhardt, ed., 
G. W. Leibniz: Mathematische Schriften (Berlin: Asher & Halle: le: Schmitt, 1849- 
63), vol. 6, p. 242, in Ariew and Garber, p. 125; in “On Nature Itself,” see $§ 
8-10, Gerhardt, vol. 4, pp. 508-10, in Ariew and Garber, pp. 159-61. 

t See Garber, “How Causes Motion: Descartes, Divine Sustenance, and 
Occasionalism,” this JOURNAL, LXXXIV, 10 (October 1987): 567-80. - 

7 For Leibniz’s first clear rejection of occasionalism, see Garber, “Motion and 
Metaphysics in the Young niz,” in M. Hooker, ed., Lerbniz: Critical and 
Interpretative Essays (Minneapolis: Minnesota, UP, 1982), esp. pp. 166-7, 171- 
2. This is also discussed in Mercer’s forthcoming monograph. 
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preoccupations. But I think that it is not entirely accurate to see 
these themes simply as competing semisystems, and to see Leibniz’s 
development as simply a matter of choosing among them. Wilson is 
not altogether wrong. Leibniz’s metaphysics is grounded in the idea 
that there are certain metaphysically basic substances that are active 
and genuine unities, and at different times he does hold different 
conceptions of what those active individuals are. On one model, the 
metaphysically basic active individuals are corporeal substances, 
thought of on analogy with animals (Wilson’s metaphysics B); on 
another, they are conceived of on analogy with Cartesian souls (Wil- 
son’s metaphysics C). (There may be others too.) Hence the appar- 
ent conflict between metaphysics B and metaphysics C which Wilson 
notes. Although both models of what is metaphysically basic remain 
live possibilities for much of his life, there is a general progression in 
Leibniz’s thought from the first (most prominent in the letters to 
Arnauld in the late 1680s) to the second (most prominent in the 
Monadology). But even when Leibniz chooses to ground his meta- 
physics in mind-like monads, corporeal substance does not necessar- 
ily drop out; to this extent, metaphysics B and C are not really 
competitors. Although they may not be metaphysically basic (as they 
seem to be in the letters to Arnauld), corporeal substances are often 
present in Leibniz’s later philosophy; this, indeed, is a central prob- 
lem in the later part of Leibniz’s correspondence with Des Bosses, 
how corporeal substance could be possible in a world of monads. To 
make sense of Leibniz’s thought, to appreciate both what remains 
constant and what changes, I think that we must distinguish between 
at least two levels in Leibniz’s metaphysics. There is, first of all, what 
we may call the abstract metaphysics, basic, structural principles. 
Here I would include the commitment to a world of metaphysically 
basic genuine individuals, individuals that are genuinely active and 
ground the rest of the contents of Leibniz’s world. It is here that we 
find the emphasis on individuality characteristic of Wilson’s meta- 
physics A, which is not in competition with her metaphysics B or C, 
but grounds them. This abstract metaphysics seems to change little 
in its broad outlines in his mature thought. But that settled, there is 
a great deal that is not. What remains is what might be called the 
concrete metaphysics, the account of what exactly it is that is meta- 
physically basic (are they mind-like or animal-like or something else 
altogether?), and how exactly the metaphysically basic grounds the 
rest of the furniture in Leibniz’s world. 

Wilson’s book is a real contribution to the English-language litera- 
ture on Leibniz. But, nevertheless, it contains a number of straight- 
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forward mistakes. Mersenne’s Harmonie universelle is 1636-37, 
not 1619 (23); Leibniz’s “Brief Demonstration” of 1686 is twice 
given as 1684 (129, 138); the reference to Leibniz’s view that God is 
a concurrent mind that gives nonhuman substances their reality is 
surely not to anything in 1678, as Wilson claims (183), but to writ- 
ings that date from 1668.° In one other place, Wilson uses a very 
early document, the Propositions quaedam physicae of 1672 to 
support a claim about Leibniz’s mature views of space, attributing to 
Leibniz a doctrine unlikely for him to have held in his later thought 
(134). In another, she identifies the quantity conserved in 
Descartes’s universe as “weight x speed” (142), rather than size 
times speed.’ 

Although Sleigh is more interested in the philosophical analysis of 
Leibniz’s thought than Wilson is, his book shares the ambition of 
explicating Leibniz’s views in their proper historical context. But 
rather than take the long view, Sleigh concentrates on a very few 
texts; while he has an evident command of the breadth of Leibniz’s 
writings, the focus is on the Discourse on Metaphysics and the im- 
portant correspondence with Arnauld. Sleigh attempts to present a 
picture of what the world looked like to Leibniz at that crucial mo- 
ment in his development, in 1686, what Leibniz himself identified as 
the beginning of his mature thought. The result is an exemplary 
piece of scholarship, historically sound and philosophically illuminat- 
ing, a model study of how to treat the elusive thought of a character 
like Leibniz. 

After some methodological preliminaries, Sleigh offers some use- 
ful background on two figures essential to understanding the histori- 
cal place of the writings he is studying. There is a very informative 
chapter on “the Great Arnauld,” Antoine Arnauld, the recipient of 
Leibniz’s letters, and the author of some very penetrating comments 
on Leibniz’s ideas (chapter 3). Sleigh there sketches out Arnauld’s 
full career and outlines his views on key philosophical and theologi- 
cal issues relevant to his exchange with Leibniz. In addition, Sleigh 
has a short chapter (chapter 2) on the intermediary in the corre- 
spondence, Ernst, Landgrave of Hessen-Rheinfels, a Catholic no- 


$ The reference uw, , to the sort of account Leibniz in the “De 
Transubstantione”’ o 1668, in the Akademie edition, G. W. Tbe saat 
Schriften und Briefe (Berlin: Akademie, 1923-), series 6, vol. 1, pp. 508-9, in 
Loemker, ed. and trans., Leibniz: Philosophical Papers a (Dor- 


drecht: Reidel, 1969), p. 116. 

’ Weight, of course, gravity (beaviness), which for Descartes was a 
deira oysa] masine, one thar bodies Kave odiy because-oF the fuid 
vortex that happens to surround the earth. 
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bleman sympathetic to Leibniz. (It was also no small matter that 
Ernst happened to know how to get in touch with the Great Ar- 
nauld.) 

But the heart of the book is a series of four chapters on Leibniz’s 
philosophical thought in the period. In chapter 4, Sleigh discusses 
the issues of freedom and contingency in the texts, and the related 
issue as to whether all properties an individual has are essential to it 
(the so-called doctrine of superessentialism). The issues here are 
familiar ones, some of the most widely discussed of Leibniz’s views, 
at least in the English-language literature. Sleigh’s discussion of 
them is very skillful, showing his mastery of the texts, the contempo- 
rary literature, and the philosophical issues. And despite the famil- 
iarity of the issues in recent discussions, Sleigh manages to make 
some surprising and original claims, arguing against superessential- 
ism and for a weaker doctrine he calls superintrinsicalness. In 
chapter 5, Sleigh takes up the question of substance in the late 
1680s, showing the variety of views about what the metaphysically 
basic substances are among which Leibniz was attempting to decide, 
whether such basic substances are to be understood on the model of 
a soul or mind, or on the model of some sort of organism, a question 
to which we shall return. Chapter 6 is entitled ‘‘Substantial Forms.” 
While the notion of a substantial form is certainly relevant to the 
discussion there, the main focus is on the question of the unity of 
substance for Leibniz, how it is that the various candidates for sub- 
stancehood that Sleigh discusses in chapter 5 constitute genuine 
individuals and unities. Chapter 7 (‘‘Action’’) is a lovely account of 
Leibniz’s views on the activity of substance. Sleigh is very good here 
at setting out the different views of Leibniz, Arnauld, and Male- 
branche (whose views are behind so much of the text of the Dis- 
course) on the quegtion of occasionalism and the action of God and 
his creatures. (Sandwiched in here is also an elegant, though brief, 
discussion of expression, accommodation, and solipsism in Leibniz 
(170—82).) The book ends with a brief epilogue (chapter 8, “Con- 
cluding Remarks”). Among the appendices is a translation of a draft 
of the short letter of 14 July 1686 (one of two letters written on that 
date), appearing in print for the first time in any language. It is 
worth remarking that Sleigh uses.the very latest in scholarly tools. 
For the Discourse on Metaphysics, he uses the edition of Lestienne, 
containing all of the variants from all extant versions of the text.’ 

10 Unfo , the Academy edition published its own preliminary version of 
that text in its Vorausedition only after Sleigh’s manuscript was complete; it 
appears in the Voraused:tion, vol. 8 (1989), pp. 1687-1739. 
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For the correspondence with Arnauld he makes use of the edition 
that Geneviève Rodis-Lewis published of the letters that Arnauld 
actually received,’ as well as the new edition of the Hanover manu- 
scripts in preparation by Reinhard Finster, based on a completely 
new transcription of the manuscripts and containing a significant 
amount of philosophically important material not currently avail- 
able in published form. 

There is so much to discuss in Sleigh’s clear and elegant book. But 
it would be interesting to discuss one theme that Sleigh’s book has in 
common with Wilson’s, Leibniz’s views on substance, body, and the 
ultimate make-up of the world in the period in question. 

Sleigh begins by distinguishing four different accounts of what 
the metaphysically basic substances in the world are like, what he 
calls the spiritual theory, the corporeal-substance theory, the modi- 
fied-corporeal-substance theory, and the monadological theory (96). 
On the spiritual theory, all there are in the world are spirits, that is, 
conscious minds (ours, God’s, and those of angels); bodies on this 
view are treated phenomenalistically, as bundles of (conscious) per- 
ceptions (98). On the corporeal-substance theory, the metaphysi- 
cally basic elements of the world are corporeal substances, unities of 
form and matter on analogy with organisms, where form and matter 
are themselves understood not to be substances capable of existing 
apart from the composite (98-9). The modified-corporeal-substance 
view is just like the straight view, except for the fact that the form 
(soul) is itself a substance, capable of independent existence, even if 
it never actually exists without a body (99-100). On either view, the 
basic model for the individual is an animal, whose soul is a form, and 
body, matter. Finally, on the monadological theory, the metaphysi- 
cally basic things of the world are monads, unextended, simple, 
soul-like entities, only some of which are conscious (100-1). 

Like Wilson, Sleigh sees the Discourse on Metaphysics and the 
subsequent correspondence with Arnauld as a combination of dif- 
ferent threads, not surprisingly, the four threads enumerated above. 
At the surface, most visibly in the correspondence with Arnauld, is 
the corporeal-substance theory, particularly in its modified form, in 
which the form (soul) is regarded as something of a substance (109). 
But often lurking beneath the surface of the Discourse and corre- 
spondence with Arnauld, in somewhat elliptical passages and in pas- 
sages that Leibniz deleted from his final drafts, as sent to Arnauld, 


11 Genevieve (Rodis-) Lewis, cd., Lettres de Leibnix à Arnauld d'après un 
manuscrit inédit (Paris: Presses Universitaires de France, 1952). 
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Sleigh sees evidence of both the spiritual theory and the monado- 
logical view. One feature that Sleigh calls our attention to is the 
numerous times, both in deleted passages and in passages which 
Leibniz.allowed to stand, in which he treats the reality of body hypo- 
thetically: if bodies are real, if bodies are not mere phenomena like 
the rainbow, then the world must contain substances that have such- 
and-such properties. Sleigh takes Leibniz’s apparent agnosticism 
about body to be a sign that he is attempting to turn Arnauld’s 
attention to the spiritual or monadological theory (103, 104; cf. 
105). He does acknowledge that the corporeal-substance view, in 
one or another of its variants, does seem to win out, particularly in 
the correspondence with Arnauld (109); in this respect he seems to 
differ from Wilson, for whom the corporeal-substance view does not 
predominate until later in the decade, as Leibniz turns his attention 
to physics. But despite the visibility of the corporeal-substance view, 
he does not think that it is Leibniz’s real view in these texts. Sleigh is 
convinced by certain arguments that Leibniz offered against the 
reality of motion and extension in these and related texts that the 
corporeal-substance view cannot represent Leibniz’s real view; for if 
there are really corporeal substances, be they basic or not, they 
would have to be extended, and if they are extended, then, Sleigh 
argues, that would cause serious problems on Leibniz’s view, prob- 
lems Leibniz explicitly recognized (112ff). Sleigh’s view seems to be 
that Leibniz presses the notion of corporeal substance in his corre- 
spondence with Arnauld not because he believes it, but only to show 
that his philosophy is suitable for grounding Catholic theology, 
making it more suitable than Descartes’s view for unifying the Cath- 
olic and Protestant churches, a project dear to Leibniz’s heart. 
Sleigh notes that Leibniz calls Arnauld’s attention to the official 
Catholic dogma that the soul is the form of the human body, some- 
thing that Leibniz can say on his own corporeal substance view, but 
something that may be more problematic on Descartes’s brand of 
dualism (108; cf. 115). In his heart, Sleigh implies, Leibniz actually 
preferred the monadological view. 

Sleigh’s account of Leibniz’s thought is certainly very illuminat- 
ing, and based on very careful reading of the sources. But I am not 
altogether convinced. First of all, I see problems with Sleigh’s attri- 
bution of the spiritual view to Leibniz in these texts. Leibniz cer- 
tainly toyed with the view that the basic constituents of his world are 
like souls; but I see very little reason to believe that, in the period in 
question, he held that the only such souls were spirits in the sense 
that Sleigh uses the term, conscious souls, capable of reason, as 
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opposed to what he was later to call monads, which come in all 
varieties, conscious and unconscious. In support of the spiritual 
view, Sleigh points to two sections of the Discourse, 14 and 34, and 
to a deleted passage from section 8 (98, 101). But section 14 begins 
with passage in which Leibniz quite explicitly argues that there is a 
substance that corresponds to every point of view God has on the 
world, something unlikely if the only substances are spirits. And 
section 34 begins with an explicit distinction between intelligent 
souls and souls that “do not know what they are nor what they 
do.”!? The best evidence Sleigh presents is a deletion Leibniz made 
from an early version of section 8 of the Discourse. In that version, 
Leibniz used the ring of Gyges as an example of an individual sub- 
stance. In that version, Leibniz originally wrote: “I speak here as if it 
were assured that this ring had a consciousness” (quoted 101). This, 
indeed, shows that Leibniz may once have regarded being conscious 
as necessary for substancehood. But, on the other hand, he did cross 
this passage out, and replace the phrase ‘had a consciousness’ with 
‘is a substance’, as Sleigh notes. He might (quickly) have realized 
that, while consciousness might be a sufficient condition for consti- 
tuting the ring a genuine substance, it is not necessary; a lesser soul 
or form would do just as well. Or it might have been an absent- 
minded slip of the pen. In either case, it seems slim evidence to 
ground the claim that Leibniz took seriously Sleigh’s spiritual view 
at this stage of his thought. 

Secondly, I am not convinced that the Leibnizian critiques of the 
reality of extension Sleigh cites are that damaging to a corporeal- 
substance point of view. To be sure, if Leibniz believes in the reality 
of corporeal substance (not to mention the claim that they are meta- 
physically basic, as he suggests in the correspondence with Arnauld), 
then he is admitting something into his world that is extended in a 
way that minds, souls, or monads are not. But that is not to say that 
corporeal substances are extended in the deepest sense. Leibniz can 
certainly hold (and probably did hold, on my view) that what is basic 
in body is force, and that extension is ultimately to be understood in 
terms of the forces bodies have. In particular, Leibniz seems to hold 
that extension results from the primitive passive force that excludes 


12 Sleigh, in fact, cites (98) this distinction in his account of the spiritual theory. 
Be also ner er Utara ad eee ean a tly de- 
leted, in which Leiboiz says that he not undertake to or not 
there are “substances, souls, or substantial forms that are not intelligent.” While 
this is perfectly consistent with the spiritual view, it 1s equally consistent with its 
contrary. 
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the penetration of one body by another. On this view, an extended 
body can be regarded as the diffusion of the force of impenetrability 
that defines a region that is the space occupied by a given body. 
Extension is, at least to that extent, less basic, metaphysically speak- 
ing, and thus less real than force. Although on Sleigh’s reading the 
arguments criticizing the notion of extension lead to a monadologi- 
cal metaphysics, in which extended things are entirely phenomenal, 
on my reading these same arguments are really part of his support 
for this somewhat different position, a world of corporeal sub- 
stances grounded in active and passive force. 

Consider, for example, the argument which Sleigh especially em- 
phasizes, in which Leibniz argues that any body (corporeal sub- 
stance) is made up of an actual infinity of smaller parts, from which 
he concludes not merely that they have shapes of infinite complex- 
ity, but also that, in the strictest sense, they have no shape at all 
(112-5). Both Sleigh and I could agree that Leibniz’s intention here 
is to undermine the Cartesian claim that the essence of body is 
extension, that extension is something that characterizes body in its 
innermost nature. Anything really extended, essentially or other- 
wise, must have some shape or another; so, by showing that despite 
appearances, bodies do not really have shapes, Leibniz shows that 
they are not really extended either. This may lead one to a world of 
nonextended and mind-like monads. But it might also be used as an 
argument against the existence of Cartesian substances whose es- 
sence is extension, and toward the sort of view I sketched above, on 
which what is basic is force. While arguments against the reality of 
extension are intended to lead one beyond the kind of superficial 
account of body one finds in the Cartesian view, they do not really 
tell us where we hit bottom, metaphysically speaking, whether it is 
the corporeal substances with their active and passive forces, or the 
monads with their perceptions and appetitions that are supposed to 
ground the world of phenomena. 

Sleigh acknowledges this in a way, and admits that an advocate of 
corporeal substance in Leibniz, like myself, can read this argument 
in this manner. In the end, his reading of the argument rests on the 
claim that “lurking in the background” is either a monadological 
analysis of body, or, perhaps weaker, the idea that there must be 
some analysis of that sort to be found and developed (115). But this 
is to admit that the issue is not the argument itself, but what it is that 
is “lurking in the background.” And whatever may be lurking in the 
background, it seems clear that, when writing to Arnauld, what is 
present in the foreground is a metaphysical picture grounded in the 


BOOK REVIEWS 165 


notion of corporeal substance. I, for one, see no reason to prefer 
the view hidden in the shadows to that which Leibniz presents explic- 
itly. In the end, I much prefer to think that Leibniz stressed the 
corporeal-substance view when writing to Arnauld not for any re- 
ligious or political reasons, but because it is, quite simply, what he 
believed, not always before or always after, but at that time. One can 
certainly find monadological currents in Leibniz’s thought in these 
years. But until someone can convince me that these undercurrents 
and suggestions are more representative of Leibniz’s true thoughts 
than what Sleigh admits Leibniz wrote to Arnauld, I shall continue 
to read Leibniz as writing what he meant, not only bere, but in the 
many other texts of the period in which the corporeal-substance 
view is emphasized.” 

Although Sleigh and I may disagree on this point, this in no way 
decreases my admiration for his accomplishment; despite our dis- 
agreements, his book remains, in my estimation, a brilliant piece of 
historical scholarship and philosophical analysis. His book and Wil- 
son’s are marks of how far scholarship in the history of philosophy 
has come over the last few years, models of the best in contemporary 
scholarship. One can only hope that they will have the influence on 
the field that they genuinely deserve, and inspire others to approach 
the history of philosophy with the kind of care and intelligence that 
these books demonstrate. 

DANIEL GARBER 
University of Chicago 


13 See Garber, “Leibniz and the Foundations of Physics: The Middle Years,” 
pp. 66-72, for an account of such texts. 
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WHY KNOWLEDGE IS MERELY TRUE BELIEF* 


claim that propositional knowledge is merely true belief, that 

an agent knows that p if and only if p is true and the agent 

believes that p. Elsewhere, I have attempted to answer the obvi- 
ous objections to that position.’ Here, I want to offer a positive 
argument for it. 

I. TERMINOLOGICAL GROUNDWORK 

One objection that has been raised to the claim that knowledge is 
merely true belief is that it seems to leave us without any preanalytic 
sense of what knowledge is. That is, it seems to be a question 
whether the proffered analysis could possibly be an analysis of 
knowledge at all.? This objection seems to arise because there is a 
very widely held intuition that belief and truth could not possibly be 
sufficient for knowledge. Then the question of whether the analysis 
embodies a theory of knowledge seems to come down to a clash of 
intuitions about what notion it is which is being analyzed. For this 
reason, I want to offer a way of picking out the notion to be defined. 
Here is my proposal: knowledge is our epistemic goal in the genera- 
tion of particular proposttional beliefs. It will be convenient to 
have a term to refer to procedures that have as their goal the genera- 


*T would like to thank Jeffrey Thumak, who commented elaborately on an 
ia bre aaae ai a 

1 “Knowledge is Merely True Belief,” American Philosophical Quarterly, 
XV, 2 (April 1991): 157-65. That essay merely lays the dwork for a 
defense of the position; roughly, I try to show there that the that 
is merely true belief cannot be refuted by the flick of a counterexampk. I refer 
ee eee a 
merely to reply to the claim that our ordi use of the term ‘knowledge’ com- 
mitted us to a distinction between knowl and mere true belief. This appeared 
to me to be the only explicit argument in literature to the effect that at least 
some third condition was required, and thus I hoped, by attacking that argument, 
to suspend the question of burden of proof. 

? This objection was brought to my attention by Tim McGrew. 
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tion of belief with regard to particular propositions. Let us refer to 
such procedures collectively as inquiry. Knowledge, then, is the 
- purpose or telos of inquiry. (This minimal conception of knowledge 
is, I believe, in keeping with the traditional use of the concept of 
knowledge in, for instance, Plato, Descartes, and Hume.) 

I say that inquiry comprises procedures that have as their goal the 
generation of belief in particular propositions, because there are 
wider-ranging epistemic goals than knowledge: for example, ration- 
ality and wisdom. But such concepts take beliefs in groups or even 
consider the entire structure of beliefs of an epistemic agent. It can 
be rational or wise, as part of such an overall structure, to believe 
that p. But one cannot have the rationality or wisdom that p, as one 
can have the knowledge that p. At any rate, we might think of a 
theory of epistemic justification as doing two things: (a) giving gen- 
eral procedures for inquiry, and (b) setting out standards for evaluat- 
ing the productions of inquiry, that is, particular propositional be- 
liefs. A belief, then, is justified if it is produced (or, alternately, 
could have been produced) by a correct procedure for inquiry,” and 
to say that a belief is justified is to evaluate it positively along episte- 
mic lines. I regard both projects as worthwhile. I am by no means 
arguing that an account of justification is unnecessary or unimpor- 
tant; my claim, rather, is that such an account is not part of the 
theory of knowledge. 

To continue with terminological groundwork, it is possible to dis- 
tinguish two sorts of epistemological projects: normative and de- 
scriptive. Descriptive epistemology is essentially a branch of psychol- 
ogy or cognitive science. It asks: How do we come to believe or to 
know? What cognitive mechanisms and causal circumstances oper- 
ate in the acquisition of belief or knowledge? What, for example, is 
the function with regard to belief and other propositional attitudes 
of perception, memory, and so forth? Normative epistemology, on 
the other hand, is what has usually been thought of as a distinctively 
philosophical discipline. It asks: How should we generate beliefs 
with regard to achieving knowledge? How should inquiry be con- 
ducted and evaluated? And what, exactly, is its goal, i.e., what is 
knowledge? 

Now, these two sorts of investigations are by no means uncon- 
nected.’ For example, it would be ludicrous to recommend (norma- 


3 On internalist views, that agent must be aware that the belief was so produced; 
on externalist views, the agent need not be aware of this fact. 

* In fact, I agree with Alvin Goldman that they are very intimately connected 
indeed. See Epistemology and Cognition (Cambridge: Harvard, 19 6). 
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tively) a procedure for generating beliefs which was literally impossi- 
ble (descriptively) to apply. Thus, the normative project might be 
thought of as giving guidelines for the correct or optimal applica- 
tion of the procedures we do use to generate beliefs. I shall be 
concerned here with normative epistemology. 
I DEONTOLOGICAL AND TELEOLOGICAL CONCEPTIONS 

The most elaborately developed normative theories are in ethics, 
and thus normative epistemology often relies on a parallel to ethics. 
Ethical systems have been divided into two kinds: deontological and 
teleological.” Proponents of the former think of-moral action as 
what is done in obedience to principles that serve in turn no end 
that could be looked on as an overall moral goal. Obligation and 
permission are central concepts here. If I do only what is permissi- 
ble (possibly, if I do it because it is permissible), or what is de- 
manded by duty (possibly, if I do it because duty demands it), then I 
am not subject to ethical disapprobation even if the result of my 
action is disastrous. According to proponents of teleological ethics, 
on the other hand, an action is morally good when it conduces to 
some goal, for example, the greatest happiness of the greatest num- 
ber, or if it is in accordance with some rule the observance of which 
so conduces. Similarly, there might be two sorts of normative episte- 
mology: one that prescribes duties and permissions in generating 
beliefs (and other propositional attitudes) without regard to any 
overarching epistemic goal, and one that prescribes some goal for 
epistemic activity and recognizes the legitimacy of any procedure 
that conduces to that goal, or, alternately, of any procedure that 
accords with certain rules, the observance of which in turn conduces 
to that goal. I have asserted that knowledge is the goal of inquiry. 
But this supposes that inquiry has some goal, which would be denied 
by a proponent of deontological normative epistemology. So we had 
better start with a discussion of whether that position is plausible. 

The taxonomy of normative epistemology suggested by this partic- 
ular parallel to ethics has been developed by William Alston.’ Be- 
cause my discussion follows his to some extent, I should pause here 
to differentiate my use of terms from his. Alston uses the term 
‘deontological’ to distinguish systems which epistemically prescribe, 


* My characterizations of these ethical positions are moat! ae inadequate to 
many actual positions. The present point, however, is simply parallel to episte- 
mology, so we need not be concerned with capturing the subtleties of full-fledged 

* “Concepts of Epistemic Justification,” in Eptstemic Justification (Ithaca: 
Cornell, 1989), pp. 81-114. 
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proscribe, or permit certain beliefs or belief-generating procedures, 
from systems he terms ‘evaluative’ which merely assess certain be- 
liefs and procedures from the standpoint of some standard. He 
points out that it is not the case that all standards of evaluation 
depend on such concepts as obligation and permission, that not all 
standards carry with them the implication that the subject is praise- 
worthy for meeting them or blameworthy for violating them. For 
example, to say that some person is beautiful is to evaluate her 
appearance positively, but it is not to say that she is praiseworthy for 
her appearance, since she may not be responsible for it; it may be a 
genetic endowment (ibid., p. 96). The relevant point here is that 
both sorts of systems (Alston’s ‘deontological’ and ‘evaluative’) are 
what I term ‘teleological’; he describes both as being directed to the 
goal of generating true belief and avoiding false belief. 

It may be a question, then, whether any philosopher has seriously 
held a deontological position in my sense, has seriously held that we 
have some epistemic obligations, but that there is no overarching 
goal of inquiry. Some extreme idealists and positivists, who identify 
truth and justification, may harbor such a view. If one has a coher- 
ence theory of truth and also a coherence theory of justification, for 
example, then one may simply count as knowledge whatever beliefs 
are generated by whatever procedures turn out to embody justifica- 
tion; if it was supposed to be-a sheer fact that we ought to follow 
such procedures, if there were no further goal in mind, this would 
be a deontological position in my sense. The notion of knowledge is 
in some sense superfluous on this position; at least, it does not 
describe a distinctive purpose for inquiry above the fulfillment of 
certain duties or obedience to certain rules. Clearer examples of 
deontological views could be proposed: for example, believe all 
and only the propositions contained in the Bible, or in the writings 
of Mao. 

Deontological views in my sense have, these days, few proponents, 
and seem on the face of it extreme and implausible. Their implausi- 
bility can be brought out in the following way. What is the source of 
our epistemic obligations? Or, to put it another way, is there any 
good reason to think that we have any distinctively epistemic obliga- 
tions at all, in the absence of some overarching purpose for inquiry? 
The same problem arises for deontological moral theories, but here 
there are plausible, or at least fairly widely proffered, answers: our 
moral obligations derive from God, for example, or from the state. 
Again, it is possible that the very same sources yield our epistemic 
obligations. But to establish this, we would have to give good rea- 
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sons to think that God does impose epistemic obligations, or to give 
an account of the “epistemic legitimacy” of the state. Furthermore, 
there no longer appears to be any distinction between moral and 
epistemic constraints on the generation of beliefs. There no longer 
appears to be any distinctively epistemological enterprise. 

I shall take it, until such objections can be answered, that a purely 
deontological conception of epistemic norms is highly implausible. 
Indeed, no contemporary philosopher with whose work I am 
acquainted holds such a view. It might be asked, then, why I have 
bothered to discuss it at all. The point is just this: inquiry does have a 
purpose; only teleological conceptions of normative epistemology 
are plausible. But what about deontological conceptions in Alston’s 
sense? Such conceptions formulate obligations and permissions, and 
award praise and blame, with regard to the way these obligations 
and permissions conduce to an overarching epistemic goal. I shall 
call such views, in keeping with my own rather than Alston’s use of 
terms, injunctive teleological accounts of normative epistemology. 
They might be thought of by analogy to rule utilitarianism, where 
one’s obligations are to follow rules, the observance of which tends 
to the greatest happiness of the greatest number. Rule utilitarianism 
has the advantage over act utilitarianism that it can yield relatively 
detailed injunctions on conduct, rather than the huge amorphous 
injunction to act so as to create the greatest good of the greatest 
number, which might be extremely difficult to apply in individual 
cases. Likewise, injunctive teleological views yield duties or permis- 
sions which, if observed, will conduce to knowledge, rather than the 
huge amorphous duty to generate beliefs so as to gain knowledge. 

Alston attacks injunctive teleological accounts as presupposing 
what has been called ‘‘doxastic voluntarism,” the view that our be- 
liefs are, in one sense or another, under our voluntary control. After 
all, I cannot have an obligation to do something or to refrain from 
doing it if it is not up to me whether I do it or not. Injunctive 
teleological conceptions of justification indeed suppose doxastic 
voluntarism. And Alston’s objections to doxastic voluntarism appear 
to me devastating. For a detailed discussion of the issue, I refer the 
reader to Alston.’ Here, I recount a single example: when I seem to 
see a tree before me, and have no reason to think that, for example, 
I am dreaming, or that the perceptual situation is grossly abnormal, 
or that someone is replacing trees in my neighborhood with plastic 


7 See esp. “The Deontological Conception of Epistemic Justification,” ın Epi- 
stemic Justification, sect. II-VI 
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replicas, it is simply not up to me whether I believe there is a tree 
before me; like it or not, I shall believe it. So I should hardly be 
praised or blamed for doing so. It should be noted that this criticism 
applies not only to injunctive teleological accounts but also to deon- 
tological accounts in my sense. 

If this criticism is plausible, we are left with a conception of justi- 
fication that Alston calls an “evaluative” or “strong position” ac- 
count. These terms describe justification from the two angles men- 
tioned above; to say that it is desirable to get into a strong position is 
to give a view about (a) the correct procedures for inquiry, while to 
evaluate someone’s epistemic position positively (though not in 
deontological terms) is to apply (b) some account of evaluation with 
regard to these procedures. We ought not, on Alston’s view, to alot 
praise and blame in epistemic matters, but some epistemic situations 
are better than others. (It is better to be beautiful than ugly, but, 
again, one is not usually praiseworthy or blameworthy in this re- 
gard.) On Alston’s view, the most plausible way to formulate this 
conception is that one’s belief is justified if it is based on adequate 
grounds, and having adequate grounds is a matter of being in a 
strong position to generate a true belief on the matter. It is a good 
thing to be in such a position, but one is not, or at least one is not 
always, responsible for getting into such a position. Now, it is not 
part of my current project to defend this conception of justification 
(though I regard it as plausible); rather, I want to use it in a certain 
way to elucidate what, on it, would be a plausible view of our episte- 
mic goal with regard to particular propositions, that is, what would 
be a plausible theory of propositional knowledge. 

OL TRUTH CONDUCIVENESS 

It is widely held that our epistemic goal is achieving true beliefs and 
avoiding false ones about propositions with which we are epistemi- 
cally concerned. (We have seen that Alston, for one, endorses that 
view.) That is, it is widely admitted that, on any good account of 
justification, there must be reason to think that the beliefs justified 
on the account are likely to be true. Indeed, proponents of all the 
major conceptions of justification hold this position. For example, 
the foundationalist Paul Moser? writes: 

[E]pistemic justification is essentially related to the so-called cognitive 

goal of truth, insofar as an individual belief is epistemically justified 

only if it is appropriately directed toward the goal of truth. More specif- 


* Empirical Justification (Dordrecht: Reidel, 1985). 
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ically, on the present conception, one is epistemically justified in believ- 
ing a proposition only if one has good reason to believe it is true (ihtd., 
P. 4). 


The reliabilist Alvin Goldman claims, similarly, that a condition on 
an account of justification is that beliefs justified on the account be 
likely to be true; he says that a plausible conception of justification 
will be “truth-linked” (op. cit., pp. 116-21). And the coherentist 
Laurence BonJour? puts it even more strongly: 


If epistemic justification were not conducive to truth in this way, if 
finding epistemically justified beliefs did not substantially increase the 
likelihood of finding true ones, epistemic justification would be irrele- 
vant to our main cognitive goal and of dubious worth. It is only if we 
have some reason to think that epistemic justification constitutes a path 
to truth that we as cognitive human beings have any motive for prefer- 
ring epistemically justified beliefs to epistemically unjustified ones. 
Epistemic justification is therefore in the final analysis only an instru- 
mental value, not an intrinsic one (tbid., p. 8). 


In fact, it is often enough taken to be the distinguishing mark of 
the fact that we are epistemically concerned with a proposition that 
we are concerned with its truth or falsity. That is what, on the view 
of many philosophers, distinguishes epistemic from moral or pru- 
dential constraints on belief, what distinguishes inquiry from other 
belief-generating procedures.’° I have argued that a plausible nor- 
mative epistemology will be teleological. And I have claimed that the 
conception that accounts of knowledge are attempting to analyze or 
describe is that of the epistemic telos with regard to particular prop- 
ositions. It would follow that, if a philosopher holds that the episte- 
mic telos is merely true belief, that philosopher implicitly commits 
himself, his own asservations to the contrary, to the view that knowl- 
edge is merely true belief. 

I think that this is the case. That is, I think that, in the above 
passages, these philosophers have committed themselves implicitly 
to the view that knowledge is merely true belief, and that justifica- 
tion is a criterion rather than a logically necessary condition of 


° The Structure of Empirical Knowledge (Cambridge: Harvard, 1985). 

10 1 hope to argue elsewhere that there are no non-epistemic belief-generating 
procedures in the relevant sense, that believing p and taking to be true are the 
same thing. If this case can be made, then I think it can be to follow that 
knowledge is the telos of inquiry. So, though I take the latter clam here as an 
unargued premise, I think it can be decisively established. 
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knowledge. By a criterion, J mean a test for whether some item has 
some property that is not itself a logically necessary condition of that 
item having that property. Justification on the present view is, first 
of all, a means by which we achieve knowledge, that is, by which we 
arrive at true beliefs, and, second, it provides a test of whether 
someone has knowledge, that is, whether her beliefs are true. So, 
again, the present view does not make accounts of justification triv- 
ial, or unconnected with the assessment of claims to know. If our 
epistemic goal with regard to particular propositions is true belief, 
then justification (a) gives procedures by which true beliefs are ob- 
tained, and (b) gives standards for evaluating the products of such 
procedures with regard to that goal. From the point of view of (a), 
justification prescribes techniques by which knowledge is gained. 
From the point of view of (b), it gives a criterion for knowledge. But 
in neither case does it describe a logically necessary condition for 
knowledge. 

Another way of putting the matter is this. If we describe justifica- 
tion as of merely instrumental value with regard to arriving at truth, 
as BonJour does explicitly, we can no longer maintain both that 
knowledge is the telos of inquiry and that justification is a necessary 
condition of knowledge. It is incoherent to build a specification of 
something regarded merely as a means of achieving some goal into 
the description of the goal itself; in such circumstances, the goal can 
be described independently of the means. So, if justification is de- 
manded because it is instrumental to true belief, it cannot also be 
maintained that knowledge is justified true belief. 

I shall now certainly be accused of begging the question by assum- 
ing that knowledge is the goal of inquiry. There is justice in this 
claim in that I have not gone very far toward establishing the 
point.!! But I would ask my accusers at this point whether they can 
do better in describing the conception that theories of knowledge 
set out to analyze or describe without begging the question in favor 
of some such theory. And I ask also, if knowledge is not the over- 
arching epistemic telos with regard to particular propositions, why 
such tremendous emphasis has been placed on the theory of knowl- 
edge in the history of philosophy, and just what function that notion 
serves within that history. If knowledge is not the overarching pur- 
pose of inquiry, then why is the notion important, and why should 


" To repeat, however, I hope to establish this claim on completely independent 
grounds. 
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we continue to be concerned in normative epistemology above all 
with what knowledge is and how it can be achieved? If we want to 
withhold the term ‘knowledge’ from mere true belief, but also want 
to hold that mere true belief is the purpose of inquiry, then, first of 
all, I think that ‘knowledge’ will now merely be a technical term with 
a stipulated definition. And, second, I do not think it will be central 
to epistemology, since it no longer represents our epistemic goal. 
And, third, I think the stipulated definition will either be redundant 
(if justification is held to be truth conducive) or, as I shall argue, 
incoherent (if it is not). 

Now, it may well be held that justification is of more than instru- 
mental value, because if we are not justified in believing p, though p 
is true and we in fact believe it, we may have false beliefs that lead us 
to p, and we may continue to generate false beliefs in the future. All 
of this is true, but it is irrelevant to the present point. Recall that I 
have characterized knowledge as our epistemic goal with regard to 
particular propositions. Insofar as p is concerned, this goal has 
been realized if p is true and we believe it. Insofar as we also have 
such goals as continuing to generate true beliefs, rendering our 
system of beliefs coherent, and so forth, it is desirable to have justi- 
fied beliefs. But with regard to any particular proposition, our goal 
has been reached if we believe that proposition and it is true. 

But I do not want simply to let the matter rest on a supposed 
agreement among some contemporary epistemologists that our 
epistemic goal with regard to particular propositions is true belief. 
Such epistemologists are agreed that knowledge is at least justified 
true belief. I think that Alston is right to think that the only plausible 
way to construe this claim is that knowledge is at least true belief 
based on adequate grounds, or true belief reached from a strong 
position. So perhaps the figures in question, on reflection, would 
describe the epistemic telos not as true belief but as true belief based 
on adequate grounds, or true belief reached from a strong position. 

It must now be asked, however, why do we want to have adequate 
grounds? Why do we want to be in a strong position? This question 
ought to be misguided if true belief based on adequate grounds or 
true belief reached from a strong position is in fact the purpose of 
inquiry. For there is no good answer to the question of why we 
desire our ultimate ends. But the question is hardly misguided. In 
fact, we cannot even specify what it is to have adequate grounds 
except that these grounds tend to establish that the proposition in 
question is true; we cannot even specify what it is to be in a strong 
position except as being in a strong position to get the truth. This 
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indicates that the purpose of inquiry can be formulated without 
reference to the notions of ground or position. Thus, on the views in 
question, believing the truth is in fact our overarching epistemic 
telos with regard to particular propositions, on the only plausible 
conception of justification. Hence, on these views, knowledge is 


merely true belief. 
IV LYCAN’S EXPLANATIONISM 


The only fully developed, contemporary account of justification of 
which I am aware that appears to steer clear of this consequence is 
that expressed by William Lycan in Judgement and Justification. !* 
He proceeds by denying that truth is the only or the overall purpose 
of inquiry. His claim is that beliefs are justified in virtue of their 
explanatory role; they are justified by their coherence with previ- 
ously held beliefs, by their simplicity, their scope in an explanatory 
framework, and so forth. And Lycan even argues that all inference 
relies on a (perhaps implicit) appeal to best explanation. For exam- 
ple, even in a simple modus ponens deduction, we are not forced to 
accept the conclusion; we always have the option of rejecting one of 
the premises. So Lycan concludes that all reasoning involves compar- 
isons of plausibility. He describes justified beliefs as those which 
increase the overall explanatory coherence of one’s system of 
beliefs. 

Lycan’s position is in some respects similar to BonJour’s; he holds 
that we have “spontaneous,” that is, noninferred beliefs that have 
some degree of prima facie justification (due to the explanatory 
virtue of conservatism, which states that we should not without some 
explanatory motivation change the beliefs we already do have). But 
Lycan’s position differs in two crucial respects from BonJour’s. 
First, Lycan goes further toward elucidating the notion of coher- 
ence in his description of the explanatory virtues. Second, whereas 
BonJour holds that any coherence theorist must give a “‘metajustifi- 
cation” of his view, that is, he must give reasons to think that beliefs 
that are justified on the view are likely to be true, Lycan demands no 
such thing. In fact, Lycan goes so far as to say: 

[I]f ... appeal to simplicity [e.g.] is a fundamental epistemic method, 

then it is fundamental; no further question can arise regarding its 

“connection to truth.” ... For that matter, I do not in the first place 

accept the a priori assumption that epistemic justification must be a 

matter of quantitative relation to truth (ibid., p. 148). 

Lycan does indeed give a “‘metajustification” of his account, how- 
ever. Simplicity, power, and the rest are useful on the evolutionary 


12 New York: Cambridge, 1988. 
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scale. They are adaptive. I am not sure how plausible this is, but the 
point is that, if we are to hold that justification is a component of 
knowledge, we need to take Lycan’s general approach. That is, we 
must describe justification as conducing to some goal other than 
truth, or else lay down justification as a deontological desideratum. 
We cannot, as we have seen, coherently maintain that knowledge is 
the telos of inquiry, that justification is logically required for knowl- 
edge, and that accounts of justification must in turn be metajustified 
by their conduciveness to truth. And Lycan manages to avoid also 
the dangers of a deontological epistemology; he does not in fact 
regard simplicity or power as sheer desiderata, but as useful adapta- 
tions. Nor does his view assume doxastic voluntarism. 

Nevertheless, the view has severe drawbacks. To begin with, it 
makes the distinction between epistemic and other values obscure. 
In fact, it absorbs epistemic value into sheer utility, and it is not at all 
clear that this is a plausible line. It is true, as Lycan admits, that 
there is no a priori reason to think that a simple hypothesis is more 
likely to be true than an extremely complex one. But a simple hy- 
pothesis has less cognitive cost; it is efficient as an explanation. That 
may make it seem that it is better in the long run to believe simple 
hypotheses. But it is not clear that it is in fact, better, epistemically 
speaking, to believe simple hypotheses. 

To get clear on the point, let us suppose (as seems likely, though 
Lycan is reticent here) that Lycan holds justification in his sense to 
be a necessary condition of knowledge, and that he holds truth also 
to be a necessary condition.’ Then knowledge turns out to be (at 
least) true belief that is generated by adaptive explanatory tech- 
niques. But this seems odd; now that we recognize two primitive 
epistemic values, they may well conflict. For example, is it good to 
believe, in some circumstances, highly explanatory falsehoods? The 
account surely leaves some such cases strictly undecidable, since it 
describes both elegance and truth as intrinsic values. But is this 
plausible? Surely, we might want to say, though it can be useful to 
believe all sorts of falsehoods, it is always epistemically good to be- 
lieve the truth. It may be useful, for example, for me to have a 
cognitive technique that causes me to believe that I have all sorts of 
positive qualities to an extremely high degree; I may be happier and 
more efficient if I believe myself to be extraordinarily intelligent, 
good-looking, humble, and so forth. But it is not a good thing epi- 


13 It is only on this assumption that Lycan’s position constitutes an alternative 
to the views described in the previous section. So I shall talk as though this is 
Lycan’s position, though, again, he does not explicitly endorse it. 


178 THE JOURNAL OF PHILOSOPHY 


stemically to believe such things if they are false. (Fortunately, how- 
ever, in my case they are all true.) In short, we seem to have a notion 
of epistemic value that is radically divorced from utility. 

This raises the fundamental difficulty with Lycan’s strategy; it 
makes prudential justification a component of knowledge. But if, as 
I say, we have a notion of epistemic justification in distinction from 
prudential justification, and if knowledge is essentially an epistemic 
matter, then Lycan’s account leaves us confused. Either we ought to 
believe what is produced by procedures that are adaptive or else we 
ought to believe what is produced by procedures that are likely to 
yield true beliefs. But we cannot coherently demand that we follow 
both of these as ultimate aims, because they may and in fact will 
conflict. Then we are left with an internally incoherent concept of 
knowledge. Of course, we can subsume one to the other; we can 
assert that the truth is adaptive and that is why it should be believed, 
or that we should believe what works on explanatory grounds be- 
cause it is likely to be true. But then, first of all, we need an account 
of why this should be so. And, second, we should now characterize 
knowledge univocally as belief in what is true, or belief in what is 
adaptive. If we take the second road, then I assert that we are going 
to hold all sorts of falsehoods to be known, and that is prima facie 
implausible. 

In fact, Lycan keeps coming back to the suggestion that explana- 
tory virtues are truth-conducive, although he often enough admits 
that there is no reason why this should be the case. This is because, I 
think, he shares the deep-seated intuition that knowledge and truth 
are bound up, that knowledge is a distinctively epistemic notion. He 
writes: 


Nor, please note, do I understand “rational” here as “likely to yield 
truth,” for this would be ... to introduce “true” at the beginning of 
our epistemological quest rather than see it fall out at the end. ... 
[I]ntroducing “true” at this point would both threaten circularity or 
regress again and invite questions of the form “But what reasons have 
you to think that U leads to truth?” where U is a posited ultimate 
principle of theory choice... In the beginning, “rational” ıs a primi- 
tive term used to evaluate epistemic acts; particular principles are later 
seen to “tend toward truth,” because the beliefs they produce are 
rational—not the other way around (op. cit., pp. 155-6). 


This passage strikes me as wanting to have it both ways. It seems that 
we simply ought to believe what is rational (simple, powerful, etc.), 
and then, as a sort of added bonus, as a matter of fact what we 
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believe will happen to be true. But this is odd if, as Lycan admits, 
there is no reason to think that a simple explanation is more likely to 
be true than a very complex one. Now, as a matter of fact, the 
notion of truth itself here may well be disintegrating, Lycan’s prag- 
matist tendencies might be extending toward taking truth to be 
warranted assertibility. The view would then face familiar and, I 
think, devastating problems, but I shall not address those here." 

But there is a further argument that Lycan gives for his position. 
He argues that we cannot be enjoined from doing what we cannot 
but do, and that we cannot fail to use explanatory criteria in gener- 
ating beliefs: 


I suspect that it is biologically necessary to a sentient organism’s sur- 
vival that the organism organize its experience m the simplest, most 
coherent and expedient, and yet most powerful and comprehensive 
way that it can. ... It thus comes to be lawlike that a viable organism 
operates, epistemically, according to the elegance principles ... or 
something like them. ... If the suspicion I have expressed is well- 
founded, the point is this: A normal human being cannot keep from 
forming beliefs in ways that have survival value, or from making use of 
the elegance principles in doing so—at least up to a point. And if that 
is so, then no one is obligated in any way to refrain from following 
those primitive procedures (op. cit., pp. 126-7). 


I would like to make several remarks about this passage. First of all, 
there is no reason whatever to believe that it is true that we are 
under adaptive necessity of adopting the most elegant explanation 
we can come up witb, which is exactly what Lycan implies. In fact, 
that is a bizarre assertion; it claims quite generally that people never, 
or at any rate very rarely, adopt a less elegant hypothesis when a 
more elegant one is available, which is obviously false. Aside from 
the questions about the nature of simplicity (which Lycan leaves 
more or less completely obscure, and which I suspect cannot be 
clarified), I can surely resolve to adopt whatever hypothesis seems 
most likely to be true, however inelegant it may seem to be. It might 
be very elegant indeed, for example, to believe that the extinction of 
the dinosaurs resulted from a nuclear test on this planet by aliens, 
but a nice messy hypothesis involving climactic changes and disease 
might seem more likely to be true. 


4 All that would be necessary to refute this view is the descripton of a single 
case in which some agent is warranted in asserting p and p is false. Such cases are 
not hard to devise. 
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In fact, Lycan here goes further in his doxastic determinism than 
does even a rabid determinist, such as Alston. For Alston admits that 
we can do our best to apply conscientious techniques of belief for- 
mation. He simply denies that the fact that this is to some extent 
under our control is enough to render the beliefs that such tech- 
niques generate voluntary and hence subject to praise and blame. 
But Lycan asserts’ that we simply must use the techniques of belief 
formation that we do use. But this is false. Advances in methodology 
have been made, and can be voluntarily adopted. Nor do they all 
yield increases in elegance. 

V CONCLUSION 
Finally, I would like to formulate the argument that knowledge is 
merely true belief as a dilemma. If accounts of justification them- 
selves must be metajustified by an argument to the effect that beliefs 
justified on the account are likely to be true, then justification is of 
instrumental rather than intrinsic value. That is, we want justifica- 
tion with regard to particular propositions not because of its intrin- 
sic worth, but because it is a means to what we regard as having 
intrinsic worth, namely, true belief. If knowledge is the epistemic 
telos with regard to particular propositions, then it follows from 
such a position that knowledge is merely true belief. On the other 
hand, if justification is valued not for its truth conduciveness, but 
for its conduciveness to some other goal, for example, successful 
adaptation, or for that matter, if justification is itself proposed as an 
intrinsic goal (a demand of reason, for example), then knowledge is 
an incoherent notion. It gives us two goals for inquiry, which cannot 
always be realized simultaneously. As far as J can see, these two 
possibilities exhaust the possible accounts of knowledge as (at least) 
justified true belief. Again, either justification is instrumental to 
truth or it is not. If it is, then knowledge is merely true belief. If it is 
not, there is no longer a coherent concept of knowledge. Thus 
knowledge is merely true belief. Q.E.D. 
CRISPIN SARTWELL 

Vanderbilt University 
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A SENTENTIAL THEORY OF PROPOSITIONAL ATTITUDES* 


would like first to provide a general framework for the seman- 
I tics of propositional attitude sentences. I shall then try to show 

how a sentential theory can very well be integrated within such 
a framework. I submit that the framework should take the form of a 
semantics that purports to give the assertability conditions of sen- 
tences. This entails that we should replace truth conditional seman- 
tics by a semantical theory that stipulates the conditions under 
which a sentence can be used and explains in this way “what role 
these sentences can play in our lives,” to use Saul Kripke’s’ happy 
phrase. This general framework, one will have guessed, is the one 
that Kripke ascribes to the Ludwig Wittgenstein of the Philosophi- 
cal Investigations.? l 

I would like first to make some general remarks on the semantics 
of assertability conditions. Such a semantics must take at least two 
factors into account: illocutionary force, which concerns the particu- 
lar force with which the sentence can be uttered; and locutionary 
meaning, which concerns the conditions under which the sentences 
can be held true. These locutionary conditions can be identified 
with objective conditions that must prevail if the sentence is true 
or with verification procedures that serve as justification for the 
assertion. 

I shall not deal with the problems related to the illocutionary 
conditions of use in this paper. Let me just say that propositional 
attitude sentences can give rise to the performance of a very wide 
variety of illocutionary acts. We can utter them intending to per- 
form directly assertives, directives, commissives, expressives, and de- 
claratives as long as certain propositional conditions prevail. First- 
person psychological sentences are particularly revealing in this re- 
spect. The five fundamental categories of illocutionary forces can be 
directly performed over them even if they are in the indicative 
mood. I mention this point just to indicate how illocutionary force 
comes into play within such a general framework. I allow illocution- 
ary force to become an important ingredient in semantics without 


* This paper was made possible partly by a Canada Research Fellowship coming 
from the Social Sciences and Humanıties Research Council of Canada. I also wish 
to thank Tyler Burge, Charles Daniels, Kit Fine, and David Kaplan for their com- 
ments and criticisms. 

Wittgenstein on Rules and Private Language (Cambridge: Harvard, 1982). 

? G. E. M. Anscombe, trans. (New York: Blackrell, 1953). 


0022-862X/92/8904/181-201 © 1992 The Journal of Philosophy, Inc. 


182 THE JOURNAL OF PHILOSOPHY 


suggesting that the indicative mood is a syntactic mark of illocution- 
ary force. There are syntactic marks of illocutionary forces only for 
nondeclarative sentences, such as those in the imperative or inter- 
rogative moods. But the ones that are of interest for my study are all 
in the indicative mood, and I am taking for granted the fact that the 
indicative mood is neutral as to which illocutionary force can di- 
rectly and literally be performed with it. 

So the incorporation of illocutionary force in a semantics of 
assertability conditions does not come from the hypothesis that illo- 
cutionary force comes into play in the very syntax of indicative sen- 
tences, but rather from the fact that I have a broader perspective 
concerning what is to count as belonging to the field of semantics. 
Without the suggestion that they are part of the very syntactic fea- 
tures of each grammatical mood, illocutionary force can be part of 
my semantical account as long as they play a functional role that is 
not directly dependent upon the meaning intentions of speakers. 

I also want to stress the fact that the locutionary conditions I 
alluded to do not involve any particular preference concerning their 
identification with truth conditions or verification procedures. As 
much as possible within the limits of this paper, I want to avoid 
having to make a choice between those two approaches. I said that 
the semantical approach I favor is one that spells out the assertabil- 
ity conditions of sentences, but this does not mean that the sen- 
tences in question do not have objective truth conditions. I just 
mean to suggest that truth conditions no longer constitute an essen- 
tial ingredient in meaning and that having meaning for any particu- 
lar sentence is not determined by the fact of having objective truth 
conditions. A semantics of assertability conditions could only be 
identified with a truth conditional semantics if we restricted the 
possible illocutionary forces to the assertive and if objective truth 
conditions were the only locutionary ingredient involved. But I pre- 
cisely want to avoid having to make such an identification. I allow 
for the possibility of there being many different illocutionary acts 
performed with the use of those sentences, and for the possibility 
that locutionary conditions be identified with verification proce- 
dures in certain cases at least. 

I 
As I already pointed out, I intend to concentrate on the locutionary 
conditions of use of propositional attitude sentences. A sentence 
such as 


(i) John believes that neutrinos have mass. 


can either be used to describe a particular atomic belief or to assert 
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the (general) existence of a belief that would satisfy certain descrip- 
tive features. According to the first reading of (i), it is asserted that 
Joba is in a certain belief relation with the sentence ‘Neutrinos have 
mass’; according to the second reading, a certain sentence is said to 
be believed by John and can be translated as ‘Neutrinos have mass’. 
This ambiguity between the two readings is semantical and affects 
the ‘that’ in (i). The presence of a that-clause in any given statement 
always produces a nominalization of the subordinate sentence and 
forms a complex name for a sentence, but this can take place in two 
radically different ways. When the whole sentence is used to report a 
particular atomic belief, the that-clause behaves a bit like a complex 
demonstrative expression or like the occurrence of a sentence 
within quotation marks. We could choose to describe it as a demon- 
strative expression or as a quotation, depending on whether we 
choose to describe belief as a relation toward an utterance or a 
sentence type. For my own part, I prefer the latter and shall there- 
fore describe this first use as reporting a belief toward a particular 
sentence type, but I could also adopt the other approach. If I had 
done so, the first use would look very much like Donald Davidson’s* 
paratactic theory of indirect speech. 

The second use I want to introduce also involves a nominalization 
of the subordinate clause but it ultimately must be contextually elim- 
inated in terms of a quantified statement. We proceed in this case as 
in Bertrand Russell’s theory of descriptions and eliminate the ex- 
pression ‘that neutrinos have mass’ as a pseudo proper name. This 
yields a general statement asserting that a certain sentence type is 
believed by John and is a translation of the sentence type ‘Neutrinos 
have mass’. 

The deep motivation for making such a distinction between 
atomic and general belief reports is that we want to allow for the 
possibility of reporting within the limits of our own language some- 
one else’s belief with his own understanding of linguistic meanings 
or his own principles of maximal rationality. This is what happens in 
the case of atomic belief reports when we simply want to describe 
the belief relation that an agent entertains with a particular sen- 
tence. It must be contrasted with the case where we assert the gen- 
eral existence of a belief and provide a translation for its content by 
exploiting the resources of our own conceptual scheme. The dis- 
tinction between the two reports is thus founded upon the possibil- 


*“On Saying That,” in his Inquiries tnto Truth and Interpretation (New 
York: Oxford, 1984), pp. 93-108. 
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ity of allowing for a relativity of conceptual schemes within a single 
language and, therefore, of making a distinction between language 
and theory. This is, of course, an antiholistic claim but I am pre- 
pared to make it. 

There are therefore two ways of representing (i). When we use the 
word ‘that’ as a name for the subordinate clause, we have 


(ii) John believes ‘Neutrinos have mass’. 


and when we use it as a pseudo proper name, we get a general 
formula like 


Gii) (Zp) [(believes (John, ‘p’)) A (p’ is translated as ‘Neutrinos have 
mass’)| 
O 
I shall return shortly to the motivations for choosing such a logical 
form. In particular, it will be instructive to give the motivation for 
introducing a substitutional rather than an objective quantifier in 
the analysis. For the moment, I want to mention a second ambiguity 
affecting this time the verb ‘believes’ in our original sentence. The 
problem is that, no matter what sentence is believed, we must distin- 
guish between its character, or linguistic meaning, and its content, 
which is identical either with its truth conditions or verification pro- 
cedures. The statement that counts as the object of belief can now 
be mentioned in order to specify either its character or its content. 
If we mention the sentence in order to specify its character, the 
whole belief sentence reports a belief relation between the agent 
and a certain linguistic meaning under a mode of presentation that 
is a certain verbal form. If we mention the subordinate clause in 
order to specify its content, the agent is then described as being in a 
relation with the content of the sentence mentioned and, in this 
case, the sentence no longer behaves as a mode of presentation. The 
ramification of sense into character and content is therefore respon- 
sible for an ambiguity affecting the attitudinal verb.* We can speak 
of two uses of those verbs. I shall speak of an intentional use of the 
verb in the case where the whole sentence relates an agent to a 
certain linguistic meaning via a sentence type, and speak of a mate- 
rial use of the verb when the whole sentence describes a relation 
between an agent and a certain content and the sentence used no 


longer serves the purpose of specifying any mode of presentation at 
all for the agent. 


4 Cf. Kaplan, ‘“Demonstratives,” ın J. Almog, J. Perry, H. Wettstein, eds, 
Themes on Kaplan (New York: Oxford, 1989), pp. 481-563. 
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On the basis of these two uses, propositional attitude sentences 
can be used to assert many different things. For example, we shall be 
tempted to make an intentional use of the verb ‘believes’ in order to 
report the actual content of an intentional belief, while the material 
use of the verb will ideally serve the purpose of reporting a func- 
tional teleological state of the agent. In the first case, the ascription 
of belief will be founded either upon sincere assent or dispositions 
to assent sincerely to a sentence. We could also include in the class 
of intentional beliefs those which are trivial logical consequences of 
sincere assents or of dispositions to assent sincerely to a sentence. In 
the case of a material attribution, we try to represent a certain func- 
tional state of the agent. It is as though the agent were described as 
trying to adapt himself to certain characteristics of the environment, 
whether these characteristics exist or not. As an example of this 
second use of the verb, we could mention the case of a dog barking 
at the bottom of a tree functionally behaving as though there were a 
cat in the tree. Another example of what I call a material belief is 
provided by the case of unconscious beliefs that the agent is trying 
to repress, perhaps without succeeding very well. We could also 
mention beliefs that are nontrivial logical consequences of an 
agent’s already existing set of beliefs. 

It is very hard to define very clearly these two uses. The distinc- 
tion serves to capture a difference between the true intentional be- 
liefs of the agent and those which he “‘in fact” has, whether he likes 
it or not and whether he knows it or not. We could, however, men- 
tion three features that can serve together as a criterion for discrimi- 
nating between the two uses. Two of those features have already 
been mentioned. First, the relatum is different depending on 
whether the use of the verb is intentional or material. In the first 
case, the relatum is a linguistic meaning under a certain verbal mode 
of presentation; in the second case, it is a certain content. Second, 
the nature of the belief relation is different whether we are dealing 
with intentional or material uses of the verb. In the case of an inten- 
tional use, we could talk about a relation of directly grasping or 
apprehending a certain intentional content; in the case of a material - 
use, we are dealing with a purely functional relation that does not 
involve grasping or apprehending any content whatsoever. Third, 
the two kinds of beliefs differ also by the fact that only intentional 
beliefs are reflexive and entail a first-person authority. If an agent 
intentionally believes that p, he represents himself as someone who 
believes that p and, as matter of fact, he knows that he believes that 
p. This is not the case for the material use. In the case of a material 
belief, the agent need not even have a reflexive representation of his 
own belief. 
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Now, since we have a second ambiguity, it follows that (i) can serve 
to describe at least four different possible states of affairs; (ji) and 
(iii) can be ysed to report intentional or material beliefs. We have 
first two distinct atomic belief reports, namely, 


(iv) John believes, ‘Neutrinos have mass’. 
and 
(v) John believes, ‘Neutrinos have mass’. 


where the subscripts indicate intentional and material beliefs. There 
are also two distinct general belief reports, namely, 


(vi) (Zp) [(believes, (John, ‘p’)) A (‘p’ is translated as ‘Neutrinos have 


mass’)] 
and 
(vii) (Zp) [(believes,, (John, ‘p’)) A (‘p’ is translated as ‘Neutrinos have 
mass’)] 


I would use (iv) if I were to report directly John’s intentional 
belief and wanted to use John’s own concepts of neutrinos and 
mass, i.e., the ones involved in his own linguistic group. He could 
have intended something different from an extremely light electri- 
cally neutral particle having a certain weight. And if, for instance, I 
had directly inferred from John’s calculating behavior a functional 
belief to the effect that neutrinos have mass without presupposing 
my own principles of rationality, I could have used (v). 

These reports are not meant necessarily to capture John’s idiolect 
but rather to allow for the presence of different dialects or different 
principles of rationality within a single linguistic community. Saying 
that (i) might be used to convey such different linguistic meanings or 
contents suggests that different conceptual schemes can come into 
play within the confinement of a single home language. It suggests 
that I can use a verbal form from my own dialect and still be able to 
express a very different conceptual scheme. Good examples of this 
take place when an English speaker is able to master the philosopbi- 
cal vocabulary of other English speaking philosophers having, views 
that are quite different from his own, or when she is a classicist but 
is able to make inferences within an intuitionistic logic. 

Of course, these differences can only make sense if the speaker 
and agent share a minimum linguistic vocabulary, and if within the 
limits of what I have called “linguistic meaning” there is a core 
shared both by the speaker and the agent. Rejecting a holistic pic- 
ture of language will enable me to allow for the possibility that two 
speakers sharing the same linguistic core could not only have a dif- 
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ferent truth theory but also a different ideology or different princi- 
ples of rationality. In short, I want accept the relativity of concep- 
tual schemes. 

In a general intentional belief report, such as the one described in 
(vi), the speaker makes an implicit use of a translation manual and 
tries to provide in his own dialect a ‘‘good”’ translation that captures 
the linguistic meaning of the sentence actually believed by the agent. 
For instance, a French scientist could have assented to the sentence 
‘Les neutrinos ont une masse’ and (vi) could be used to convey this 
information. In a general material belief report, such as the one 
described in (vii), he tries to provide a translation that captures in 
his own dialect the same content as the one that is materially be- 
lieved by the agent. For instance, I could have heard of an Italian 
scientist who believed that the things he had just located in the 
laboratory under Mont-Blanc have mass, not knowing that these 
things are neutrinos. It would then be wrong to say that he inten- 
tionally believed that neutrinos have mass, but correct to ascribe a 
material belief to that effect. 

In both cases, however, the speaker is making use of a principle of 
charity and is trying to maximize agreement with the agent. In such 
an intentional belief report, he is maximizing agreement concerning 
his understanding of linguistic meanings; in the case of a material 
report, he is maximizing agreement over content. In other words, 
he is either projecting onto the agent his own understanding of the 
linguistic meanings and presupposing that a good one-to-one trans- 
lation of those linguistic meanings is possible between the two dia- 
lects, or projecting the same functional relations toward content 
and presupposing that a good one-to-one translation could be made 


that preserves the content of the original sentence. 
m 


I should consider one last distinction: between reports that are used 
to represent basic beliefs, and those which are used to represent 
derived beliefs. Accordingly, the atomic intentional basic beliefs are 
going to be those which are ascribed on the basis of sincere assent to 
sentences (or of dispositions to a sincere assent) that do not presup- 
pose any inferential act on the part of the agent. The atomic inten- 
tional derived beliefs are those which are going to be inferred from 
atomic intentional basic beliefs of the agent. For instance, if the 
agent asserts or is in a disposition to assert ‘p’ and asserts or is in a 
disposition to assert ‘p materially implies q’, and if he does not infer 
these from any other previous beliefs, we have basic intentional 
atomic beliefs. But if the agent is in addition disposed to assent to ‘g’ 
on the basis of those previous assertions or dispositions, we are then 
dealing with derived intentional basic beliefs. In other words, the 
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atomic intentional derived beliefs are going to be those which pre- 
suppose basic intentional beliefs and a certain understanding of the 
linguistic meaning of expressions. It is important to point out, how- 
ever, that this understanding is the one presupposed to be on the 
part of the agent and does not necessarily coincide with our under- 
standing. 

The same kind of remarks apply mutadis mutandis to atomic mate- 
rial beliefs. There are those which are ascribed directly from the 
behavior of the agent and those which are ascribed on the basis of 
previous ones. For instance, I can directly infer from the behavior 
of a dog barking at the bottom of a tree the material basic atomic 
belief that the thing seen before is now in the tree. On this basis, I 
can then get a derived material atomic belief to the effect that there 
is a cat in the tree. The derived atomic material beliefs will include 
those which we ascribe while presupposing a certain inferential ca- 
pacity on the part of the agent. Again, the inferential capacity is not 
necessarily going to coincide with our own. 

These differences between basic and derived beliefs also occur in 
the case where the belief reports take the form of general quantified 
statements. The difference is that our own understanding of linguis- 
tic meaning, in the case of an intentional report, and our own infer- 
ential capacity, in the case of a material report, will be the things 
that we are going to presuppose in order to generate derived beliefs 
from those which are basic. 

I could multiply distinctions like these, but I am going to stop 
here. The three distinctions I have introduced will serve my pur- 
poses for the time being. I shall just mention, in passing, other 
distinctions that I chose to ignore in order to simplify my presenta- 
tion. I earlier said I would not discuss the different varieties of 
illocutionary acts that can be performed with propositional attitude 
sentences. I shall also leave aside problems related to the fact that 
we are using a third-person rather than a first-person psychological 
sentence. We would have to deal ultimately also with the distinction 
between de dicto and de re beliefs and with sentences involving 
iterated belief operators. 

Before ending this taxonomic presentation, I wish to draw your 
attention to another very important feature of our belief reports. I 
am thinking about the pragmatic presuppositions that come into 
play when we are making use of these sentences. For instance, when 
we are using a propositional attitude sentence to report an atomic 
intentional basic belief, we can presuppose that the agent has a 
complete understanding of the linguistic meaning of the sentence, 
but we could as well cancel this presupposition. It is not information 
contained in the literal meaning of the belief sentence. As I have 
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described them, intentional attitudes entail a relationship with lin- 
guistic meanings that are mediated by language. A certain verbal 
form serves the purpose of a mode of presentation. This is perfectly 
compatible with the possibility that the agent has an incomplete 
understanding of the sentence that behaves as the object of his 
attitude. Therefore, when we are taking for granted the fact that the 
agent has a complete understanding of the linguistic meaning at- 
tached to the sentence by the members of his own linguistic commu- 
nity, we are making a pragmatic presupposition that can by defini- 
tion be cancelled. When we are making such a pragmatic presuppo- 
sition, however, our intentional reports will involve an assumption 
of minimal subjective rationality on the part of the agent whether or 
not the rationality principles are similar to our own. 

By minimal rationality I mean a rationality that is based solely 
upon an understanding of the linguistic meanings attached to the 
logical connectives and quantifiers by the members of the commu- 
nity to which the agent belongs. Since I wish to promote a view of 
language that involves the idea of a general linguistic practice gov- 
erned by indeterminate semantical rules attached to the primitive 
expressions, i.e., semantical rules that do not anticipate all the appli- 
cations of the expressions, the rationality I am talking about is not 
holistic. 

As far as basic atomic material beliefs are concerned, we do not 
need to make such presuppositions. The reason is that the agent is 
only in a functional relation with the sentence. An agent can be in 
such a functional relation to a sentence even without being able to 
speak a language. The sentence mentioned in the belief report 
merely serves to describe the specific teleology involved in his func- 
tional states. We can pragmatically presuppose, however, that the 
agent has a certain ‘‘know-how,”’ i.e., a certain maximal inferential 
capacity. The degree of complexity of these maximal inferential ca- 
pacities will vary depending on whether we are presupposing the 
know-how of a community of chimpanzees, human beings, or artifi- 
cally intelligent devices. 

When we are ascribing material beliefs, we are often presuppos- 
ing a variety of degrees of subjective rationality on the part of the 
agent. But since this is also a pragmatic presupposition, it could be 
cancelled. The ascription of material atomic beliefs does not entail 
semantically a subjective rationality on the part of the agent. But if 
we were to make such a pragmatic presupposition, we could de- 
scribe the beliefs of the agent in a fashion that would rather fit well 
with a holistic picture. As it was mentioned earlier, my view remains 
quite opposite to a holistic account since I allow for intentional and 
material beliefs quite independently from any maximal rationality 


190 THE JOURNAL OF PHILOSOPHY 


assumption. But I do not wish to elaborate any further on my rejec- 
tion of holism since this would lead us far beyond the scope of the 
present paper. 

The two pragmatic presuppositions that I have described—the 
one concerning the knowledge that the speaker has of the linguistic 
meanings that are involved in our intentional reports, and the maxi- 
mal rationality assumption involved in our material reports—often 
accompany our belief ascriptions. It is perhaps worth noting that it 
is only because we are making these presuppositions that we are able 
to allow for derived beliefs. It is only because we are presupposing a 
complete understanding of the linguistic meanings that we can in 
the context of an intentional ascription allow for the substitution of 
synonyms or draw an inference based upon the linguistic meanings 
attached to the logical connectives and, therefore, make derived 
intentional ascriptions. It is only by presupposing a certain maximal 
inferential ability on the part of the agent that we can in the context 
of material belief ascriptions make derived material ascriptions, such 
as those which are logical consequences of basic material beliefs. 

Pragmatic presuppositions also come into play in our use of quan- 
tified statements that assert the general existence of intentional or 
material beliefs. I have chosen to represent this use by introducing a 
translation relation. This is because, in such a use, the subordinate 
clause is the result of a translation in the dialect of the speaker. 
When the speaker is using the sentence in this way and is attempting 
an intentional report, he very often presupposes his own under- 
standing of the linguistic meaning of expressions in the subordinate 
clause. Most of the time, speakers presuppose that they have a com- 
plete understanding of the words they are using but, of course, we 
cannot exclude the possibility of an incomplete understanding. It is 
once again a pragmatic presupposition that he may want to cancel. 

Once this pragmatic presupposition is made, however, the 
speaker may ascribe, if he wishes to do so, derived intentional beliefs 
that result, for instance, from the substitution of synonyms within 
his own dialect. He will very often want to substitute for the expres- 
sion that is a good translation of the one involved in the sentence 
believed by the agent. In these instances, the general intentional 
belief ascriptions are those which the agent would entertain if he 
were a member of the same linguistic community, in the same con- 
text and the same perspective as the speaker’s. If the agent had used 
the translation manual actually used by the speaker, the beliefs that 
are ascribed to him by the speaker would involve sentences that he 
would be disposed to accept. 

But the speaker will sometimes also want to ascribe derived beliefs 
resulting from the substitution of expressions that, relative to the 
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context and to the perspective of the speaker, determine the same 
content as the one that is materially entertained by the agent. And 
when we assert the existence of such and such a material belief in a 
sentence that can be translated in such and such a way, we very often 
presuppose as speakers a certain maximal inferential capacity. We 
presuppose that we have access to objective rational principles. As 
soon as we make such presuppositions, we are in a position to 
ascribe to the agent general material beliefs that are derived and not 
only basic. These new belief ascriptions are a consequence not only 
of a translation that is preserving the content of the original sen- 
tence in the dialect of the speaker, but also of projecting onto the 


agent the rationality principles the speaker is willing to accept. 
IV 


Many things could be said concerning the relationship between all 
those different locutionary conditions and the project of a sentential 
theory of propositional attitudes. But I shall raise only one main 
issue. I would like to claim that this very large variety of uses can be 
accounted for, not in spite of the fact that we are dealing with a 
sentential theory but rather precisely because we are using such a 
theory. 

This claim might sound surprising in light of the difficulties with 
which traditional quotational theories are confronted. Many authors 
have argued for quotational theories,® but they have for a very long 
time now fallen in disrepute. For one thing, they very often find 
themselves inextricably tied with an empiricist epistemology or with 
a nominalistic ontology, which gives them an unnecessary air of con- 
troversy. The truth is that a sentential theory can be formulated 
while preserving a relative neutrality regarding these issues. The 
major difficulties are rather of a logico-semantic nature. Philo- 
sophers® have been concerned about the fact that semantic para- 
doxes could be creeping through the back door. It has also been 
maintained that quotational theories in general would fail a transla- 
tion test.” Others have claimed that quotational theories will discour- 
age any attempt at constructing an epistemic logic. The charge is 
also very often made that, within the framework of a sentential 


5 E.g., Wittgenstein, op. ctt.; Rudolf Carnap, Monine one Necesen (Chicago: 
University Press, 1947); Davidson, op. cit.; Isra®l Scheffler, “An Inscriptional 
Approach to Indirect Quotation,” Analysis, x (1954). 83—90. 

E.g., Richard Montague and David Kaplan, “A Paradox Regained,” in Rich- 
mond Thomason, ed., Formal Philosophy: Selected Papers of Richard Monta- 
gus (New Haven: Yale, 1974), pp. 271-85; Thomason, ‘Indirect Discourse Is not 
Quotational,” The Monist, Lx 1977) 340-54. 

7 Cf. Alonzo Church, “On Carnap’s Analysis of Statements of Assertion and 
Belief,” Analysis, x (1950): 97-9 

8 Eg., oe Bealer, Quality and Concept (New York: Oxford, 1982). 
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theory, belief contents are too finely discriminated.’ At the same 
time, the theory makes it impossible to discriminate between homo- 
phonic beliefs that differ in content, whether this is because the 
sentence believed is ambiguous or because it belongs to two differ- 
ent languages (ibid.). There is also the difficulty of accounting for 
the fact that the content of a belief is not merely a pure verbal form, 
but rather something that has certain semantic properties, without 
reintroducing things that look very much like propositions.’ There 
is also, of course, the difficulty of accommodating de re attitudes 
within the framework.'’ On top of all of this, the theory is in any 
case confronted with even more obvious difficulties. It has an artifi- 
cial character and is unable to explain why we ascribe propositional 
attitudes to creatures that are devoid of any-language capacities. 

It is going to be useful to examine briefly the particular formal 
representation I wish to introduce for propositional attitude sen- 
tences. When a sentence such as (i) is used to assert the general 
“existence” of a belief that neutrinos have mass, it can be repre- 
sented as 


(viii) (Zp) [ (believes (John, ‘p’)) A (‘p’ is translated as ‘Neutrinos have 
mass’)] 


I leave aside the distinction between the material and intentional 

. This particular analysis reminds us a bit of the one that was 

once defended by Rudolf Carnap,’? but many very important differ- 
ences need to be mentioned. First, the quantifier is substitutional 
and not objectual. Second, the verb ‘believes’ must not be reduced 
to a disposition to assent to a sentence as prescribed by Carnap’s 
empiricism. This is because we allow for a material reading as well as 
an intentional one. It is only for the latter that we may find a dispo- 
sition to assent. Third, the substitutional variable ‘‘stands for” a 
sentence in use and is not an ambiguous name of sentences. The 
sentences are those which belong to the substitutional class and 
these semantically presuppose their semantical rules. Therefore, a 
substitutional formula cannot express anything meaningful unless 
its substitutional instances, i.e., the formulas that result from the 
substitution of the variable by an expression in the substitutional 


°? Cf. Max Cresswell, “‘Quotational Theories of Propositional Attitudes,” Jour- 
nal of Philosophical Logic, 1x (1980): 17-40 
cy op. cit.; Stephen Schiffer, Remnants of Meaning (Cambridge: 
MIT, 198 
1 Cf. W. V. Quine, ‘‘Quantifiers and Propositional Attitudes,” this JOURNAL, 
Lim, 5 (March 1, 1956): 177-87. 
12 Meaning and Necessity (Chicago: University Press, 1947). 
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class, are themselves meaningful. But the substitutional instances 
are going to be meaningful only if the subtituenda themselves, i.e., 
the sentences in the substitutional class corresponding to the vari- 
able, are already meaningful. A constraint of meaningfulness is 
therefore already registered in the substitutional classes and guaran- 
tees the meaningfulness of the whole substitutional formula. Now, 
since the sentences that occur within quotes in a substitutional for- 
mula semantically presuppose their semantical rules, it is not neces- 
sary to mention explicitly the fact that the sentence belongs to a 
language, as it is the case for Carnap. Finally, the relation of inten- 
sional isomorphism is replaced by a weaker relation of translation. 
These apparently subtle differences have, I believe, enormous con- 
sequences. 

It is very important to notice that any expression can belong to a 
substitutional class. Nothing prevents the left-hand parenthesis or a 
comma from appearing among the members of any given class. But 
if the substitutional instances of a formula like (viii) are to be syntac- 
tically well-formed, the substitutional class must contain full sen- 
tences only. In addition, a moment of reflection reveals that, in 
order to express truth conditions or verification procedures, (viii) 
must also be evaluated relatively to a substitutional class in which the 
sentences are already meaningful. There are two cases to consider. 
If (viii) is used in the intentional sense, it can express a content only 
if the substituenda have at least a linguistic meaning. If (viii) is used 
in the material sense, it can express a content only if the substi- 
tuenda themselves express either truth conditions or verification 
procedures. 

So it depends what we mean by ‘meaningful’. A substitutional 
formula not satisfying any such constraints can certainly be ‘mean- 
ingful’ if by that word we understand ‘having a linguistic meaning’. 
But is it meaningful in the sense of expressing truth conditions (or 
verification procedures)? A sentence has truth conditions only if it 
shows the conditions under which it is true. Now, a formula like (viii) 
would do that if it would show that it is true or that it is false under 
each of its substitutional instances. As an existential statement, it is 
true only if the disjunction of its substitutional instances is true. (A 
universal substitutional quantification will be true only if the con- 
junction of substitutional instances is true.) But the disjunction will 
be true only if each of the disjuncts have truth conditions. If (viii) is 
an intentional report, the disjuncts will express truth conditions 
only if the quoted sentences have at least a linguistic meaning. If it is 
a material report, it will express truth conditions only if the quoted 
sentences in the substitutional instances themselves express truth 
conditions. 


194 THE JOURNAL OF PHILOSOPHY 


It is very important not to confuse the substitutional quantifier 
with an objectual quantifier over verbal forms and their meanings. 
In such an objective quantified formula, sentences would come 
“wearing their meanings on their sleeves.” In a substitutional lan- 
guage, subtituenda semantically presuppose their meanings. These 
are therefore not asserted explicitly in the formula. Their existence 
only comes into play as a constraint that must be satisfied if the 
substitutional formula is to express a content. A sentence in use 
generally does presuppose semantically its semantical rules. So it is 
correct to say that substituenda do enter sentences with such seman- 
tic presuppositions. Once again, this is because their having mean- 
ing is a constraint on the very meaningfulness of the substitutional 
quantifier.’* 

These constraints of meaningfulness must be satisfied by the sub- 
stituenda even when they occur within the context of quotation 
marks as in (viii). This brings us to another important feature of 
substitutional quantification and it is the status of quotation marks 
in such a language. This peculiar feature is present not only when a 
substitutional variable occurs within quotes, but also whenever a 
particular lexical item does. Atomic belief reports, such as in (ii), 
(iv), and (v), must be seen as substitutional instances of the formula 
‘(Zp) (believes (John, ‘p’)’, and so whatever I have to say about the 
behavior of quotes in a general formula also applies to quotes in 
particular substitutional instances. Now, whenever a sentence oc- 
curs within quotes in a substitutional instance, it is as though it were 
simultaneously used and mentioned. We have already seen that the 
substituenda enter the substitutional instances while presupposing 
their semantical rules. We must now see that they behave a bit like 
arguments for the quotation function. When a quotation occurs in a 
substitutional formula, the only way to encode it is first to under- 
stand the sentence as used, and then take it as an argument for the 
quotation function. 

The choice to represent a propositional attitude sentence as a 
substitutional formula has the advantage of revealing the twofold 
nature of the sentences that are objects of belief, i.e., of being simul- 
taneously used and mentioned. In contradistinction with an objec- 
tual representation in-which the sentence is only mentioned, our 
own characterization circumvents some of the traditional difficul- 
ties. The sentences occurring within quotes presuppose their se- 


13 Cf. M. Sainsbury, “Russell on Constructions and Fictons,” Theoria, XLVI 
(1980): 19-36. 
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mantic interpretation and are therefore modes of presentations for 
them. As opposed to the Carnapian classical way of formulating the 
quotationalist view, the agent is not described as being in a relation 
with a pure verbal form relative to a language L, but rather with a 
linguistic meaning via a verbal form. If a change of vocabulary is 
required, I do not mind to call my theory a ‘‘quasiquotational 
theory.” The expressions ‘quotational’ or ‘sentential’ still seem to 
me to be accurate renditions, for it was never in the mind of the 
classical quotationalist philosophers to suggest that we believe verbal 
forms without contents. 

In (viii), the substitutional variable occurs in a quotational context 
and quotational contexts are very often described as the opaque 
contexts par excellence. Whether or not this is true, it is natural to 
allow substitutional quantification in these contexts. The result of 
putting a substitutional variable into quotes does not yield a name 
for the variable. It is rather more like a quotation function occur- 
ring all by itself. Quantifying substitutionally into a quotation con- 
text is simply binding the variable of the quotation function. Quotes 
are like descriptive functions that take objects as arguments and also 
have objects for values, unlike propositional functions whose values 
are truth values. They form a peculiar sort of descriptive function 
first, however, because the value of the function is in these cases 
precisely its argument. Another peculiar feature is that, contrary to 
the descriptive functions in Principia Mathematica, the quotation 
function is primitive and cannot be defined. 

It is very important not to confuse the substitutional quantifier 
with an objectual quantifier over expressions. A formula like (viii) 
does not assert the existence of an expression. Existence is not as- 
serted but presupposed semantically. The existence of expressions 
in the substitutional class is a condition that must be met for the 
formula to have any truth value whatsoever. The existence or non- 
existence of expressions in the substitutional class determines 
whether the formula has a truth value or not. If an objectual quanti- 
fier were used, an empty domain would yield falsity. By contrast, a 
substitutional formula like (viii), evaluated relatively to an empty 
substitutional class, has a truth-value gap. The formula is neither 
true nor false but remains meaningful. This does not imply, how- 
ever, that it is meaningful even in the absence of the concept of a 
language. The notion of a language understood in the sense of a 
system of syntactic and semantic rules is itself implied in the very 
notion of a substitutional class whether it is empty or not. Since 
attitudinal verbs are represented as ranging over substitutional 
classes, this means that we cannot make sense in belief reports of the 
notion of belief without the notion of a language. 
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Now, putting these two features together—the constraint of 
meaningfulness on the substituenda and the particular behavior of 
quotation marks—reveals yet another way to discriminate between 
the substitutional quantifier and an objective quantifier over ex- 
pressions. One could want to make the suggestion that a substitu- 
tional quantifier is nothing more than an objective quantifier over 
expressions that semantically presuppose their semantical rules. Ac- 
cording to this account, however, the believer is related to a pure 
verbal form and has no relation whatsoever with its meaning. In 
other words, we lose an important feature gained by the use of a 
substitutional representation and it is the fact that the quoted ex- 
pressions are used and not only mentioned. By ‘used’, I do not 
mean, of course, that what is quoted is a concrete inscription. It is 
more like Peter Strawson’s notion according to which a use is a 
sentence type in a context. The substituenda occurs in a quotation 
context as it would occur in any other linguistic context. For in- 
stance, no one will deny that the substituenda are “used” in substi- 
tutional instances, such as those for ‘(2p) (John believes that p)’. 
The result of replacing the variable by a sentence like ‘Neutrinos 
have mass’ yields the sentence ‘John believes that neutrinos have 
mass’, and no one will deny the fact that in the latter the subordinate 
clause is used. My point is simply that, when they occur within the 
context of quotation marks, substituenda are used in the same way 
except that they are at the same time ‘‘arguments’’ for the quotation 
function. 

This apparently subtle difference with an objective quantification 
over expressions is what justifies us in saying that, in a substitutional 
formula like (viii), the verbal form occurs as a mode of presentation 
for its meaning. Since the substituend is used.and presupposes its 
semantical rules even when it appears in the context of a quotation 
function, we can give a coberent representation of the idea that the 
verbal form behaves as a mode of presentation and this is precisely 
something that an objective representation cannot do. 

v 


I shall now try to show how the account solves many of the tradi- 
tional puzzles generally associated with a quotational approach. The 
theory enables us to account for the fact that we can ascribe beliefs 
to creatures that are unable to speak. Material ascriptions of beliefs, 
as we have seen, do not entail that the agent intentionally grasps any 
linguistic contents., Material ascriptions only serve the purpose of 
describing functional states of the agents and the linguistic compo- 
nent marks their specific teleology. 

We are also in a position to solve another well-known difficulty. 
As previously mentioned, the substitutional variables range over 
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classes containing expressions that presuppose their semantical 
rules. If these expressions are ambiguous or belong to different 
languages, they will presuppose different semantical rules. When 
the expression occurs in a given substitutional instance, the speaker 
will very often presuppose, in accordance with the literal meaning of 
the sentence, a particular lexical meaning. And this is how we are 
able to account for the differences between belief ascriptions made 
with a unique verbal form. The answer is again that the expression 
comes fully equipped with all its semantical rules and that we are 
presupposing only one. The fact that our ordinary belief ascriptions 
are sensitive to ambiguity does not tell anything against sentential 
theories. It simply reveals that speakers who are making belief 
ascriptions must make pragmatic presuppositions in order to 
achieve a specific claim. But this is not anything peculiar, since it 
concerns all fragments of natural languages and not only proposi- 
tional attitude sentences. 

Another classical objection loses its raison d’étre. This is the claim 
that, within the framework of a quotational theory, belief contents 
are too finely discriminated. There seems to be no way to allow for 
the substitution of synonyms. But as we have already seen, this is no 
longer true concerning the present approach. Since the substi- 
tuenda occur in substitutional instances with their associated seman- 
tical rules, the theory allows for uses made in accordance with the 
literal meaning in which it is presupposed that the speaker has ac- 
cess to the linguistic meaning of the sentence believed. The account 
I have just sketched indicates that it is only in the case of basic 
intentional atomic reports, in which the speaker does not’ presup- 
pose that the agent has full understanding of the sentence believed, 
that a homophonic rendering of the belief content is required and 
prevents any substitutions to be made. 

On the other hand, the theory provides at the same time neat 
solutions to philosophical puzzles, such as those discussed by Ben- 
son Mates, Tyler Burge, and Kripke’* concerning the apparent oc- 
casional failure of interchangeability between synonyms or between 
expressions that are good translations of each other. As a matter of 
fact, traditional sentential theories have always retained a certain 
plausibility precisely because they were able to explain why some- 
times even synonyms cannot be substituted with each other in belief 


14 Mates, “Synonymıty,” in Leonard Linsky, ed., Semantics and the Philosophy 
of Language (Chicago: Hlmois UP, 1953), pe pp- 111-36, Burge, “Belief and Synon- 
ymy,’ ' this JOURNAL, LXXV, 3 (March 2 119-38; Kripke, “A Puzzle about 
Belief,” in Avishai Margalit, ed., Meaning and Use (Dordrecht Reidel, 1979), 
pp. 239-88. 
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contexts and why there seems to be no criterion of synonymy good 
enough to block such puzzles. We can explain why some intentional 
belief reports seem to resist interchangeability of synonyms no mat- 
ter what the criterion is. In an intentional report, it is as though the 
verbal form were a mode of presentation for the linguistic meaning. 
I said that the substituenda occur with their associated semantical 
rules, but it must be emphasized that they do not ‘‘wear them on 
their sleeves” as they would if we had decided to quantify objectively 
over sentences and their associated meanings. Our account is not 
only compatible with the possibility of an incomplete understanding, 
it provides also an explanation for it. If the agent had direct access 
to concepts without the mediation of verbal forms, it would indeed 
be difficult to explain how someone could have an incomplete un- 
derstanding of concepts. 

Although I cannot show it in detail in the present paper, it is clear 
that the theory will enable us successfully to pass Alonzo Church’s 
translation test. As it was said above, there is no longer a need to 
make an explicit reference to the languages to which the quoted 
sentences belong. Quotations are not, under the present account, 
names of pure verbal forms (nor of verbal forms plus linguistic 
meaning), but rather sentences in use as substituenda within the 
scope of a quotation function. Good translations will therefore not 
require a homophonic translation of the quoted sentences. We are 
thus in a position to maintain a sentential account without having to 
face difficulties, such as those which Church saw in Carnap’s pro- 
posal. There are no longer any problems involved in allowing for the 
same transformations to take place in the quoted sentence of the 
definiens and in the subordinate clause of the definiendum. In other 
words, the definiens can be translated in another language without 
losing its status of definiens. Its translation in another language can 
express the same thing as what is expressed by the translation of the 
definiendum in the same language. 

Another traditional objection concerns the possibility of develop- 
ing an epistemic logic. According to the objector, this is not an 
option available to the quotationalist. The objection is that, in a 
sentential account, no inferential patterns can be allowed. But it 
now appears to be a wrong diagnosis. Within the framework of the 
present theory, it is even a bit anachronic. There is indeed a diffi- 
culty under the present account to provide a univocal answer to the 
question concerning the validity of the rules of introduction and 
elimination for the logical connectives in belief contexts. The reason 
is not, however, that the sentential account does not permit any 
inference to take place, but rather that there cannot be an absolute 
answer to the question. It all depends on the use being made of the 
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sentence. Are we using the sentence to report an atomic belief or to 
assert the general existence of a belief under a description? Are we 
making an intentional or a material use of the psychological verb? 
More specifically, are we presupposing a complete understanding 
on the part of the agent or are we not? If so, is it in the dialect of the 
agent or in the dialect of the speaker? Are we presupposing a maxi- 
mal inferential capacity on the part of the agent and, if so, is it a 
subjective rationality or an objective rationality? It should then be 
obvious that a question regarding the acceptance of a particular 
inferential pattern in epistemic logic cannot be answered unless we 
answer all these other questions before. 

Our theory provides a general picture in which everything ap- 
pears to be a matter of degree. There is a use in which no inference 
is allowed. This is when we make a third-person basic atomic inten- 
tional report and do not presuppose a complete understanding of 
the sentence believed on the part of the agent. At the other end of 
the spectrum, there is a use in which the speaker makes a derived 
material existential report while making use of his own rationality 
principles. This is the quasi solipsist stance occupied by a Davidson- 
ian radical interpreter. But in between those two extremes there is a 
vast array of possibilities for which our theory is able to account. 

Finally, the use of a substitutional quantifier prevents the resur- 
gence of paradoxes. The explanation goes this way. If substitutional 
classes may contain any expression, there appear to be strong con- 
straints on the substitutional classes corresponding to the substitu- 
tional formulas that are meaningful. In the present case, a proposi- 
tional attitude sentence with an mtentional reading requires for its 
meaningfulness substitutional classes containing expressions with at 
least a linguistic meaning. When a material reading is made, the 
requirement is even stronger since the sentences in the substitu- 
tional class must have a content, i.e., express truth conditions or 
verification procedures. Now, if the propositional attitude sentence 
occurs in its own substitutional class (or a sentence with a quantifier 
ranging over the same substitutional class or presupposing the exis- 
tence of such a class), it could fail to express determinate truth 
conditions or verification procedures. And as a matter of fact, it will 
not express any content whatsoever if it is a paradoxical sentence. 
So even if, in general, a substitutional formula can be self-referential 
without any danger, the ones that could become paradoxical will 
either fail to express a determinate content or else will satisfy con- 
straints similar to the ones imposed by ramified type theory. 

We are now in a position to see that there is sometimes something 
wrong in allowing that a formula be a member of its own substitu- 
tional class because, in these circumstances, we must conclude that it 


e 


200 THE JOURNAL OF PHILOSOPHY 


has truth conditions only if it is true or false, while a sentence can 
only be true or false if it already has truth conditions. For example, 
let us consider a material reading of (viii), adopt a truth conditional 
characterization of content, and wonder whether (viii) expresses 
truth conditions when it occurs as substituted in one of its substitu- 
tional instances. Since the question is precisely whether (viii) has 
truth conditions or not, it cannot be claimed that it is true or that it 
is false. And so the formula cannot tell whether it is true or false 
when the substituend is (viii) itself. But telling that it is true or that it 
is false under each of its substitutional instances was precisely the 
condition that had to be met in order to express truth conditions. 
The conclusion is that (viii) does not express truth conditions if it is 
allowed among the members of its own substitutional class. A mo- 
ment of reflection reveals that similar problems would reoccur if the 
substitutional class corresponding to (viii) were to contain, perhaps 
not (viii) itself, but a formula with the same substitutional class as the 
one corresponding to (viii). The class would contain a formula that 
appears in its own substitutional class and we would have the same 
problems all over again. And if the conclusion is that it lacks truth 
conditions, this will again deprive (vili) itself of expressing truth 
conditions. 

We can imagine a third case in which a substitutional class violates 
the meaningfulness requirement imposed above. Let us suppose 
that the class corresponding to (viii) contains neither (viii) itself nor 
a quantified formula with the same class, but a formula with a sub- 
stitutional class that contains either (viii) itself or a substituend with 
the same class as the one corresponding to (viii). This is also a case 
where (i) would not become true or false under each of its substitu- 
tional instances and thus would not express definite truth condi- 
tions. As a matter of fact, we could imagine an infinite list of in- 
stances of this third case. It can be generalized as saying that a 
formula like (viii) does not express truth conditions if it contains a 
formula that involves, somewhere along the line for its definition, a 
substitutional class that contains a formula with the same substitu- 
tional class as the one corresponding to (viii). 

The three cases examined correspond to Russell’s three formula- 
tions of the vicious-circle principle, which states, first, that an entity 
cannot become an argument for a given function if it cannot be 
defined without defining the function. In this first formulation, 
what is ruled out is the possibility for a function to become its own 
argument. In its second formulation, it is said that an entity cannot 
be the argument for a given function if it involves a reference to the 
domain of the function. This rules out the possibility of having 
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among its arguments a function that involves structurally a quantifi- 
cation over the same domain. In its third formulation, the principle 
states that an entity cannot be an argument for a given function if it 
presupposes that the domain of the function has already been de- 
fined and this rules out cases where the argument has (presupposes) 
a domain in which a function involves structurally a quantification 
over (or is defined by) the same domain. The conclusion is that 
substitutional formulas require very often for their meaningfulness 
substitutional classes of a determinate order. These classes must 
very often satisfy the constraints imposed by ramified type theory. 
Of course, there are numerous differences between a substitu- 
tional language and a language similar to the one developed in 
Principia Mathematica. Our substitutional formulas do not involve 
subscripts indicating a specific order. They are rather systematically 
ambiguous expressions. Secondly, they sometimes allow for self- 
reference to take place. We could nonproblematically treat a for- 
mula like ‘(Zp) (‘p’ is at T and at P and is a sentence)’ as meaningful 
and true even when the formula in question is the only one at T and 
at P. In other cases, ramification will be imposed only if we want to 
say that the sentence is meaningful (understood in the sense of ex- 
pressing truth conditions or verification procedures). This is what 
happens in the case where we report Epimenides’ belief that all 
Cretans are liars. The substitutional representation of such a belief 
report requires for its meaningfulness that the variable range over a 
substitutional class that does not contain the sentence itself and is 
well-ordered. Finally, in the case of truly paradoxical sentences, 
such as “The sentence written at T and P is false’, where the sen- 
tence in question is the only thing written at T and P, a substitu- 
tional representation does not block self-reference per se, but it 
enables us to show and it fails to express any content whatsoever. 
MICHEL SEYMOUR 
Université de Montréal 
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Freedom within Reason. SUSAN WOLF. New York: Oxford University 
Press, 1990. xii, 162 p. Cloth $24.95. 


Interest in perennial philosophical issues waxes and wanes for rea- 
sons it is not easy to fathom. At the moment, the topic of freedom is 
the focus of much attention. The rewards for those of us who are 
professionally committed to plow through this literature have been 
meager. When excessively detailed analyses and distinctions are of- 
fered as a substitute for insight and originality, the result is a clear 
and dreary product. There are alternatives; but they are often highly 
murky and elusive. 

Susan Wolf’s Freedom within Reason is a refreshing exception. 
Bold positions are advanced, elaborated with adequate, not exorbi- 
tant detail, and forcefully defended. The writing is crisp and not 
ponderous. The result is achieved in part through the virtual ab- 
sence of references to current literature. Wolf’s command of this 
literature permeates the work, but obliquely. Some would prefer 
more explicit engagement with the debates of the day; but the clarity 
and directness with which Wolf develops the line of her argument 
would then have been sacrificed. 

Most significantly, Wolf’s perspective on the issues is original. 
Although the theories that are surveyed are familiar, the way in 
which they are sorted is illuminating. Instead of the familiar classifi- 
cations, Wolf offers a trichotomy: the autonomy view, the real-self 
view, and the reason view. The reason view, she contends, provides 
the best conception of the sorts of beings we must be if we are to be 
responsible for the results of our wills. 

The asymmetric conception of responsibility she earlier intro- 
duced into the literature and here develops is also original. A re- 
sponsible agent is one who has the ability to respond to the true and 
the good. A moral compulsive, one who is unable to do the wrong 
thing, is responsible if she is acting out of her perception of the 
rightness of her action. So in this case, determination is compatible 
with responsibility; but one who is (psychologically) determined to 
do the wrong thing lacks the ability to do otherwise and is, there- 
fore, blameless. (For reasons I shall soon examine, physical determi- 
nation may leave intact the ability to do otherwise.) 

This conception, Wolf contends, permits a compromise stance on 
the debate between those who see the free-will problem as basically 
metaphysical and those who view it as essentially moral. She convinc- 
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ingly rejects the antimetaphysical view identified with Peter Straw- 
son. (Since a bona fide ability—to understand and act in accordance 
with the right—grounds the moral legitimacy of praise and blame, 
responsibility is a factual matter.) But valuational considerations 
enter directly into the nature of this ability: our interest in the con- 
ditions of praise and blame is an interest in knowing if someone is 
able to be moved to act in the right way. If, guided by the thought 
that morally neutral actions can be performed by a responsible 
agent, we ignore the valuational element and construe responsibility 
as some sort of robust, independent self-activity, we will not be able 
to explain why such agents deserve credit or merit blame. 

Moral responsibility is a moral notion. And Wolf is certainly right 
that the link between freedom and responsibility implicates the no- 
tion of the ability to do the right thing. But that we are interested in 
freedom because it is relevant to the assignment of praise and blame 
does not imply that freedom itself must be characterized in value-la- 
den terms. A doctor is interested in sickness because he wants to 
treat sick people. There is a conceptual link between sickness and 
treatment—only sick people should be treated. But not all sick peo- 
ple are treated and, more significantly, the concept of illness may 
well be characterizable independently of the concept of treatment. 
Analogously, even though only free actions deserve to be praised or 
blamed, not all free actions merit such responses. So freedom as 
such need not be defined in terms of moral concepts. If I owe 
Sam $10, my ability to do the right thing is just my ability to give 
Sam $10. 

The defects of the autonomy view are familiar. Either my choice is 
determined, in which case J am not autonomous, or my choice has 
its origin in nothing, in which case it is arbitrary and in no clear 
sense an expression of me. 

The real-self view is the compatibilist idea that, even though I am 
sometimes unfree when I am doing what I most want to do, there is 
a way of distinguishing those cases from cases of genuine freedom 
and responsibility in which the desires that are expressed represent 
the “real” self. The basic problem with this account is that the real 
self may be ill and, therefore, not responsible. 

On what conception of responsibility can we say this? If we are 
claiming that the mentally ill person is not responsible for his self, 
we cannot substitute the autonomy conception for the real-self con- 
ception, for the autonomy conception is incoherent. 

The reason view absolves a person who is ill because the person is 
unable to guide his action by the dictates of reason, where reason is 
the faculty that is inclined to lead us to the true and the good, if 
there are such things. Since coercion is action in which I am pre- 
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vented from taking into account some relevant consideration, rea- 
son, characterized in this normative way, cannot coerce. 

But is not my freedom diminished just a little if I do not autono- 
mously choose rationality? Even if I have taken all factors into ac- 
count and A is the rational choice, might I not be forced by reason 
to choose A? 

If freedom requires that I retain the power to reject reason, I 
shall never have a reason to exercise this power (for if I do, the 
choice is rational and J am not rejecting reason). Yet even if Wolf is 
right, we do in fact seem to have this power and, strange as this 
power may be, it does not yet follow that its possession is not a 
necessary condition of the sort of freedom that grounds moral re- 
sponsibility. Furthermore, Joel Feinberg’ has observed that one may 
just delight in the realization that there are more options than one 
will ever want to exercise, or one may simply believe that liberty is a 
supreme value irreducible to its extrinsic connection to other values 
its possession may promote. (I shall soon examine Wolf’s likely re- 
buttal according to which the value of liberty just is its connection to 
the rational or the promotion of the right.) 

But if I can have a reason to possess a power I shall never have a 
reason to exercise, are we not making a crucial concession to the 
reason view? No, for I am not rejecting reason. I might simply be- 
lieve that freedom is an independent value, even a greater value 
than reason, which is enhanced by the power ta reject reason, even 
though the value of freedom depends upon a rational defense. A 
hedonist believes that pleasure is greater than reason, yet may con- 
cede that he must defend that position rationally. He might also take 
it as evident that the life of a person who derives pleasure from the 
knowledge that he can reject reason is enhanced by the possession 
of that very power. 

Yet Wolf strengthens her case by a further consideration. Since 
the concern of the book is the discovery of the conditions of the 
desert of attributions of responsibility, if we concede to this current 
concern for autonomy, it will follow that an autonomous rational 
agent is more (free and) praiseworthy than a nonautonomous 
rational agent. But this cannot be, for there is no difference be- 
tween the two agents in terms of all morally relevant considerations. 
Each took into account all the factors bearing on the rightness or 
wrongness of the act. The “freedom” of the autonomous agent is 
morally vacuous. True freedom (or at least the freedom that is neces- 
sary for moral responsibility) is the freedom to exercise and act in 


1 “The Interest in Liberty on the Scales,” Rights, ais and the Bounds of 
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accordance with reason. A good person is deeply praiseworthy for 
his actions because his moral training has led him to see and to want 
to conform himself to the right. Similarly, a musician deserves credit 
for her performance because she aligns her abilities with the aesthet- 
ically relevant qualities of the performance. 

Wolf is concerned to deflect criticism from those who interpret 
her as conflating freedom with goodness on behalf of a puritanical 
ethic. First, there are values other than moral ones and a rational 
agent should, therefore, appreciate reasons to promote nonmoral 
values perhaps at the expense of moral ones. Second, it is highly 
plausible that there are normative controversies that cannot be 
rationally adjudicated even by omniscient agents. If this second doc- 
trine (“normative pluralism”) is true, reason does not completely 
constrain values and two completely rational agents can in certain 
cases freely and responsibly advance conflicting goals. 

Accordingly, once we abandon a narrow conception of reason, we 
will not be worried about the image of a person submitting grudg- 
ingly to the dictates of morality; for if his grudge has a rational basis, 
it will have a legitimate claim upon his attention. Reason, with its 
multiple sources, may recommend selfishness or adventure or the 
temporary suspension of morality. 

Thus, Wolf supposes, the objection that true freedom is captured 
better by the real-self view may be put to rest. The charge that “our 
interest in freedom is an interest in acting according to our own 
reasons and not, as the Reason View suggests, an interest in acting 
by right reasons” (139) depends upon a failure to appreciate the 
variegated nature of reasons and the plausibility of normative plu- 
ralism. 

Consider a test case. Alicia has received an admirable upbringing. 
She has internalized the values she has been taught through an open 
process of questioning and rational discussion. Her values track the 
good to the extent that this is objectively possible. Alicia finds to her 
dismay that she is developing powerful impulses to abandon the 
moral and nonmoral values she acknowledges to be correct. She - 
finds herself identifying with these selfish impulses, while at the 
same time condemning the error of her potential ways. She has been 
taught that it is sometimes acceptable to act selfishly and to devote 
some of her time and energy to the promotion of her own talents 
and interests even when this time and energy could have been used 
to serve altruistic enterprises. But her selfish interests in this case 
are not benign; they involve gratuitous harm to others and have no 
redeeming quality other than Alicia’s personal gratification. Alicia 
recognizes that her personal gratification is a reason to satisfy her 
selfish desires; but she also sees that it is overridden by the harm she 


206 THE JOURNAL OF PHILOSOPHY 


would inflict on others. Although her struggle is a constant one, ‘in 
the end she does not veer from the path of duty. She acts rightly and 
responsibly, but feels that life is passing her by as she presses on to 
do her duty at the expense of what matters most to her. She is 
further frustrated by the realization that she is doing what is overall 
the best thing to do—for in her special circumstances that does not 
coincide with the most preferred course of action. 

So Alicia is moved by the right because it is right, but is neither 
content and perhaps may not even be able to act otherwise. She is, 
in a sense, a good person and may well be morally praiseworthy for 
her actions. But why say she is free? Alicia is unable to express her 
deepest self, the aspect of her being she would, if she could, display 
to the world as the manifestation of her individuality. (Wolf con- 
cedes the importance of these nonrational determinants of values 
and decisions as giving “us our individuality and distinctiveness” 
(146).) She is frustrated and ambivalent. Her reason tells her that 
overall she is behaving rationally; her appetitive nature, with which 
she identifies, could not care less. 

We cannot just assume that reason will find a way out of this 
dilemma. To be sure, Alicia might seek psychotherapy and might 
discover that the elimination of her dark impulses will lead to a 
happier life. She might then have a reason to change. Or she might 
learn that the satisfaction of her seedy nature has significance suffi- 
cient to excuse at least minor harm to others. But then again she 
might not make these discoveries. 

When Wolf concedes that nonrational determinants “at the limits 
... can be in tension with our freedom and responsibility” (146), 
she appears to be addressing the limitations upon our openminded- 


-y ness resulting from contingencies of environment, religion, race, 


and the like, rather than the conflicts I am worried about in which 
reason is not impaired by one’s nonrational nature, but is rather felt 
to be a force alien to the self. Now I am not defending the real-self 
view’s conception of responsibility, for I agree that someone might 
be deeply insane in a way rendering him morally blameless. Yet the 
real self’s conception of freedom is not thereby undermined and 
remains a serious competitor to Wolf’s. To be sure, it is not prob- 
lem-free. Wolf’s reference to insanity is well-taken. A minimal level 
of rationality and personal integrity is required for any self to be 
free or responsible. In fact, if this point is conceded to Wolf—and 
there is no doubt that it is an important concession—the difference 
between the real-self and the reason views becomes very narrow. 
Remember that (1) the reason view does not assert the objectivity of 
reason—there may be no truth or good to discover, (2) the doctrine 
of normative pluralism maintains that rational adjudication of dis- 
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putes is not always even theoretically possible—from a rational 
point of view, deep cultural impasses, for example, may have tò, 
“endure; (3) reason itself imposes no limits on the sources of value or 
the sorts of reasons there are. In her effort to minimize the Platonic 
sounding character of her doctrine, Wolf has rendered the responsi- 
ble agent as basically the nonirrational agent and we can all agree 
that irrationality and moral responsibility are incompatible. 

I found Wolf’s views on the connection of these ideas to deter- 
minism problematic. We are told that Rose, realizing that she should 
grade her ethics papers, instead succumbs to temptation and 
watches The Philadelphia Story for the seventh time. If her deci- 
sion is determined by her desire to see Cary Grant again (no doubt 
just a part of the sufficient condition), she is unfree and cannot be 
blamed for her decision. A necessary condition of the ability to do 
otherwise is psychological indetermination, defined in terms of the 
consequent, not the antecedent factors. For example, a bleeding 
ulcer physically causes the rupture of the stomach lining, but psy- 
chologically causes a decision to seek medical help. (Wolf offers no 
account of the physical-psychological distinction, relying instead on 
our intuitions.) A free, psychologically undetermined action might 
well be physically determined. More precisely, a bodily movement 
which is physically necessary (in virtue of the appropriate antecedent 
conditions and laws) might well be the bodily movement involved in 
an action the agent had the ability to prevent and for which the ~ 
agent may be held responsible. We cannot automatically suppose 
that a physical sufficient condition impedes the agent from per- 
forming an alternative action. 

To a certain extent, this line of thought is part of the standard 
compatibilist line. Compatibilists claim that nomic necessity is dif- 
ferent from inability, so that determination does not automatically 
preclude the ability to do otherwise insofar as ability is relevant to 
freedom and responsibility. What is distinctive in Wolf’s account is 
her insistence that psychological determination is to be treated as 
more devastating than physical determination. Most compatibilists 
suppose that they will have a hard time selling their doctrine to 
those who believe that the most likely sort of determination in the 
sphere of human behavior is physical or physiological. Conse- 
quently, we are often requested to reflect on the implications of 
psychological determination because these are intuitively the most 
benign. How can Rose’s desire render her incapacitated in relation 
to paper grading? Can a desire to watch Cary Grant be likened to 
blindness or narcolepsy? In Wolf’s zeal to make psychological deter- 
minism as fundamental as far as we know as physical determinism, 
she seems to me to go overboard. She rejects both a strong compati- 
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bilist position according to which neither physical nor psychological 
determination of action per se precludes the ability to do otherwise 
and a modified position according to which only physical determina- 
tion negates freedom. The latter view asserts that freedom is a real- 
ity if our behavior is subsumed under certain sorts of laws, if, per- 
haps, folk, intentionalistic, or certain forms of cognitive psychology 
win the day, and is an illusion if the ultimate laws of human behavior 
are physiological. Wolf’s view, on the contrary, is that only psycho- 
logical determination automatically rules out freedom and it is en- 
tirely unclear why and strongly counterintuitive that psychological 
determination is singled out in this way. 

A clue to the reasons for Wolf’s adoption of this curious opinion 
might be found in the examples of psychological determination pro- 
vided: “Rose could have been coerced or even hypnotized into 
watching TV, or obsessed with Cary Grant. Alternatively, she could 
have a pathological aversion to grading papers that she is helpless to 
overcome except in the face of, say, death threats from her students 
or the imminent prospect of losing her job” (105). To be sure, if all 
psychological explanations are like these, Wolf is right. And given 
that Wolf says that “the thesis of psychological determinism has 
virtually nothing to support it” (101), we have some reason to be- 
lieve that she is using these examples as models and drawing the 
warranted conclusion that not all actions are determined in these 
sorts of ways. But no consideration is given of the possibility of 
deterministic psychological theories that govern ordinary as well as 
compelled behavior. Since Wolf does not criticize the notion of a 
psychological law itself, as Donald Davidson does, I do not see why 
there cannot be such a law entering into a deterministic account of 
ordinary, noncompelled behavior. 

As reviewers are wont to do, I have concentrated upon what I take 
to be key problems in the position. Others will, I am sure, show 
concern about different matters: for example, Wolf’s belief that an 
action’s being determined by a perception of its rightness is compati- 
ble with complete physical determinism is worth closer examination; 
her conception of rationality as the faculty most likely to lead us to 
the true and the good will certainly be challenged. There is enough 
controversy for everybody. 

I would hope that Wolf will address these concerns, for it is im- 
portant to know if the rationality conception of responsibility is 
ultimately viable. I know of no recent work that has taken us so far 
in the direction of establishing this result. 

BERNARD BEROFSKY 
Columbia University 
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The editors regret the untimely death of Hector-Neri Castañeda, the Mah- 
lon Powell Professor of Philosophy at Indiana University/Bloomington on 
September 7, 1991, at the age of 66. Born in the village of San Vicente in 
Guatemala, Professor Castafieda received his degrees at the Univ. of Minne- 
sota before returning to Guatemala to teach for a year. Having studied 
thereafter on a fellowship at Oxford, he visited for a year at Duke before he 
accepted a position at Wayne State Univ. While there Professor Castañeda 
founded the journal Notis, of which he remained the editor until the time 
of his death. In 1969 he moved to Indiana Univ., where he served as Dean 
of Latino Affairs from 1978 to 1981. Through the years he held a number 
of visiting appointments, directed six summer and one year-long NEH insti- 
tutes, and was the recipient of a Guggenheim Fellowship. During 1979-80 
he served as President of the Central Division of the APA. In 1990 he was 
elected to the American Academy of Arts and Sciences. Very shortly before 
his death he received Guatemala’s highest honor, the Order del Quetzal. 


In honor of Hector-Neri Castañeda, the Department of Philosophy at In- 
diana University has named one of its regular annual fellowships after him. 
In addition, a fund has been started at the Indiana Univ. Foundation. Con- 
tributions will help support Guatemalans’ undergraduate or graduate study 
of philosophy at Indiana Univ. or at a university in Guatemala. 


The editors also regret the untimely death of Gregory Vlastos, Professor 
Emeritus at both Princeton University and the University of California/ 
Berkeley, on Saturday, October 12, 1991. Professor Vlastos was born in 
Istanbul, known then as Constantinople, coming to the United States to 
study at the age of 18. He earned a doctorate of divinity at Chicago Theolog- 
ical Seminary and a doctorate in philosophy at Harvard. Professor Vlastos 
taught at Queen’s College in Kingston, Ontario, from 1931 to 1948, when 
he went to Cornell. After seven years there, he went to Princeton, where he 
became chairman of the Phil. Dept. Upon retiring from Princeton in 1976, 
he moved to Berkeley and taught there for more than a decade. During his 
career, Professor Vlastos wrote several texts and edited more than 50 
books. He served as President of the Eastern Division of the APA. Among 
his numerous awards and honors, he won two Guggenheim Fellowships and 
was awarded the MacArthur Foundation fellowship in 1990. 


In honor of Gregory Vlastos, a conference on the theme ‘‘Socratic Studies” 
will be held at the University of California/Berkeley on May 1-3. During 
the weekend there will also be a memorial service. The conference will 
include five sessions: (1) Socratic Method; (2) Virtue; (3) Political Theory; 
(4) Love and Sexuality; (5) The Theory of Forms. The speakers will be Julia 
Annas, Alan Code, Gail Fine, David Halperin, Terence Irwin, Richard 
Kraut, Jonathan Lear, Robert Meister, Alexander Nehamas, Martha Nyss- 
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baum, Gerasimos Santas, and Bernard Williams. Those interested in attend- 
ing the memorial service and conference may contact Robert Meister, Poli- 
tics Dept., Kresge College, Univ. of California, Santa Cruz, CA 95064. 


The Department of Philosophy at the University of California/Riverside is 
pleased to announce the appointment of Jonathan Wilwerding as Assistant 
Professor, Mark Ravizza as Assistant Professor, Genoveva Marti as Assistant 
Professor, and H Carl Hoefer as Assistant Professor. 


The Department of Philosophy at Emory University is pleased to announce 
the appointment of David Carr as Professor. 


The Department of Philosophy at Ohio State University is pleased to an- 
nounce the appointments of Neil Tennant and Mark Wilson as Professors, 
William Taschek and Kathleen Cook as Assistant Professors, and Anna 
Greco as Instructor. 


The Department of Philosophy at Purdue University is pleased to announce 
the appointment of Leonard Harris as Professor of Philosophy and Direc- 
tor of the Afro-American Studies and Research Center, and Martin J. Ma- 
tustik as Assistant Professor of Philosophy. 
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POINTERS TO TRUTH* 


C onsider the following: 


Jack: What I am saying at this very moment is nonsense 
Jil: Yes, what you have just said is nonsense. 


Apparently, Jack spoke nonsense and Jill spoke to the point. But 
Jack and Jill seem to have asserted the same thing: that Jack spoke 
nonsense. Wherefore the difference? 

To avoid the vagueness and the conflicting intuitions that go with 
‘nonsense’, let us replace it by ‘not true’ and recast the puzzle as 
follows: 


line 1: The sentence on line 1 is not true. 
lme 2: The sentence on line 1 is not true. 


If we try to evaluate the sentence on line 1 we find ourselves going 
in an unending cycle. For this reason alone we may conclude that it 
is not a true sentence. Moreover, we are driven to this conclusion by 
an elementary reasoning: if the sentence on line 1 is true then what 
it asserts is true, but what it asserts is that the sentence on line 1 is 
not true. Consequently, that sentence is not true. But when we write 
this true conclusion on line 2 we find ourselves repeating the very 
same sentence. It seems that we are unable to deny the truth of the 
sentence on line 1 without asserting it at the same time. 


a el ee basic ideas were presented in October 1985, at the Bar-Hillel 
memorial conference held at Boston University. Further developments were re- 

rted in various lectures and invited talks at: Harvard, Princeton, Stanford, 

CLA, UC Irvine, CSLI, the Pacific APA meeting, the University of Texas/Aus- 
tin, and the 1991 ASL meeting. The technical core was presented in the 1988 
TARK meeting and published in the proceedings (cf. fn. 3). I have benefited from 
many reactions. From the many to whom thanks are due I shall only mention 
David Kaplan, Rohit Parikh (who found a bug in one of the earlier proofs), 
Charles Parsons, Hilary Putnam, and Brian Skyrms. 
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Essentially, the puzzle is a variant of what Bas van Fraassen’! has 
named “‘the strengthened liar’”—called by some ‘‘the metaliar’’— 
which I shall simplify here to the strong liar. We might recall at this 
point that the usual version of the liar paradox consists in the fact 
that the sentence on line 1, or some analog of it, is neither true nor 
false, because either assumption leads to contradiction. That version 
can be resolved by admitting gaps, i.e., by giving up the rule that 
every sentence must be either true or false. The present paradox is 
an altogether different kettle of fish. It consists in our seeming in- 
ability to express a true conclusion—namely, that a certain sentence 
is not true—without repeating the very same sentence whose truth 
we deny, or one equivalent to it. 

It is known that the semantic paradoxes can be reconstructed in 
various modal frameworks (cf. Richard Montague’). Indeed, using a 
knowledge predicate we get an analogous puzzle: 


line 1: What is written on line 1 is not known by Jill to be true. 


Regarding the sentence, Jill concludes that, indeed, she does not 
know it to be true. (If she knows it to be true, then it is true; but, 
given what the sentence says, this implies that she does not know it 
to be true, hence—a contradiction.) Jill writes her conclusion on 
line 2: 


line 2: What is written on line 1 is not known by Jill to be true. 


So Jill knows the truth of what is written on line 2, because she has 
just deduced it, but not of what is written on line 1. A similar puzzle 
can be made with ‘known’ replaced by ‘believed’. With ‘necessary’ 
we get the following version: 


line 1: The sentence on line 1 is not necessarily true. 


line 2: The sentence on line 1 is not necessarily true. 


Again, the line-1] sentence is not necessarily true; because if it were, 
it would not be true and, a fortiori, not necessarily true. This con- 
clusion ts necessarily true, because we have just proved it. Writing it 
on line 2, we get a necessarily true sentence, which merely repeats 
the not necessarily true sentence on line 1. 

The moral of all these puzzles is simple. In situations of this na- 
ture we should assign truth values not to sentence types but to their 


le ” 
eri tery Implication, and Self-reference,” this JOURNAL, LXV, 5 (March 
7, 1968): 1 2. 
* “Syntactical Treatments of Modality, with Corollaries on Reflection Princ 
and Finite Axiomatizability,” Acta Philosophica Fennica, xvi (1963): 153-67. 
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tokens. Having concluded that the line-] sentence is not true, we 
state the conclusion by displaying another token of the very same 
sentence. The second token, on line 2, expresses something alto- 
gether different from what is expressed, if anything, by the token on 
line 1. The people who agreed with Jill in the Jack and Jill exchange 
had no difficulty in grasping this point. 

In this respect, modal predicates like ‘know’, ‘believe’, and ‘neces- 
sary’ are in the same boat with ‘true’ and the same remedy is re- 
quired, namely, to allow different tokens of the same sentence to 
have different meanings, or—if you want—to express different 
statements, or different propositions. One should expect the for- 
malisms to differ according to the predicate in question, but the 
same general framework will underlie them. In the present work, I 
set up the formalism for truth, thereby providing also the basic 
framework for the various modalities. 

Following the intuition just given, we can see that in general what 
a token expresses depends on (1) what it says, i.e., on the sentence 
type and (2) on the whole network: on the tokens to which the 
sentence refers and on the tokens to which they in their turn refer, 
etc. This is what distinguishes the self-referential sentence token on 
line 1 from its non-self-referential brother on line 2. 

I shall propose a formal setup for specifying networks of tokens 
and an algorithm for assigning truth values. It assigns the sentence 
token on line 1 the value GAP, that on line 2 the value T (True), and 
it yields the same kind of intuitive results in general. 

The following “Preview” contains a rather nontechnical presenta- 
tion of the method, with an illustrative example. It also contains 
philosophical points and clarifications concerning the proposal’s sig- 
nificance, some of which are elaborated in later sections. In section 
TI—where the concept of a black hole is defined—a wider perspec- 
tive is given, which clarifies the proposal’s place and motives, within 
the frameworks of natural-language semantics, and in the philo- 
sophy of mathematics. Section III is devoted to the issues of indexi- 
cality, levels of truth, and the import of a non-Tarskian semantics. 
Section IV discusses some points concerning pointers and proposi- 
tions, and comments on some aspects of the Barwise-Etchemendy 
work. In section V, previous accounts of the strong liar are dis- 


cussed: those of Charles Parsons, Tyler Burge, and Brian Skyrms. 
I PREVIEW 


I base the formalism on three truth values T, F, and GAP. I shall 
refer to the first two as the standard truth values. The third value, 
GAP, signifies more than mere absence of a standard value. It signi- 
fies recognized failure. The idea is that we assign tokens, or—as we 
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shall later see—pointers, the value GAP when, in the course of the 
evaluation, we conclude that they must fail, i.e., that the rules will 
not enable us to assign them a standard value. We can then go on 
and use other, “uninfected” tokens to assert of a previously failed 
token that it is not true, or not false, or that it is neither true 
nor false. 

Those acquainted with Kleene’s three-valued logic will be quick to 
match GAP with Kleene’s u, which stands for ‘undefined’. GAP is, 
however, an active value, not a mere “undefined”; an assignment of 
it can serve as a basis for further assignments of T, or of F. By 
making GAP an active value we can construct on top of the gap 
instead falling into it. 

Now, the concept of a token is too narrow for the purpose of a 
general framework. For we might want to refer to sentences without 
having them displayed somewhere as tokens. Suppose that there is 
just one sentence on line n, then we can use ‘the negation of the 
sentence on line n’ in order to refer to the negation of that sen- 
tence, independently of there being a token of that negation. And, 
in general, we might speak about the sentence obtained from that 
on line n by any syntactic manipulation, e.g., the substitution of 
some term by another. Therefore, I use a more general concept, 
that of a pointer. A pointer is any object that ts used to point to some 
object. A token constitutes a special kind of pointer, it points to the 
type of which it ts a token. 

Pointing is determined by a variety of conventions. Being a 
pointer is not something intrinsic in an object, but amounts solely to 
its functioning in a certain role. Any object can serve. If desired, we 
can regard descriptions as pointers to the described objects. And 
one can set up a formalism whereby every sentence type is a pointer 
to itself. The solution to the semantic paradoxes requires, however, 
that we have many pointers to the same sentence type, so that when 
a pointer fails another is available. Any object can be pointed to. But 
since in this paper only pointers to sentence types are needed, I shall 
reserve ‘pointer’ for this case, unless the context indicates oth- 
erwise. 

I find it is useful not to enter, at least at this stage, into the nature 
of pointers, but to take them as primitives. Thus, I assume a lan- 
guage, a set of objects called “pointers,” which have names in that 
language, and some fixed arbitrary function that correlates with 
each pointer a sentence (of that very same language) to which it 
points. A pointer whose name occurs in some sentence can point to 
any sentence whatsoever (including the sentence containing its 
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name). All possibilities of direct or indirect self-reference are thus 
covered. 

Only the following additional structural elements are needed for 
the system. We have to correlate, with each pointer to a compound 
sentence, pointers that point to its sentential components. Thus, for 
any pointer p pointing to A*B, where * is a sentential connective, 
there are pointers pl and p2, which point, respectively, to A and to 
B. And if p points to the negation TA, then p1 points to A (and we 
can, in this case, stipulate that p2 = p1). I call p1 and p2 the derived 
pointers of p. If the language has quantifiers, we also correlate with 
every p that points to VvA(v), or to avA(v), and with every constant 
term t, a derived pointer p|t, which points to A(t) (the result of 
substituting ¢ for v in A(v)). 

Note that, if p is a token, then p1 and p2 can be simply identified 
with the corresponding subtokens. But with quantifiers, we must go 
beyond tokens, for in general we cannot identify p|¢ with a token. 
For our present purposes, as long as quantifiers are not included, 
we can take our pointers to be just tokens. The reader may, for the 
sake of convenience, assume this to be the case. The extension to the 
general case is straightforward. 

The upshot of this approach is a new kind of semantics in which 
truth values are assigned to pointers and the usual recursive defini- 
tion of truth is replaced by a set of rules for evaluating networks. 
Here is an informal description of the algorithm, illustrated by a 
simple example. This, I think, should give an adequate idea of the 
method, providing thereby a basis for the discussion that follows. (A 
full mathematical presentation is planned for a separate companion 
paper. A concise version containing the formal definitions and the 
statements, without proofs, of the main theorems can be found in 
my TARK paper.*) 

Consider the above mentioned system of language and pointers. 
Assume that the language (based on predicates and, possibly, func- 
tion symbols) is interpreted, except for two, so-called truth predit- 
cates (or semantic predicates): Trà ) for ‘true’ and Fa( ) for 
‘false’. These two predicates take pointer names as arguments. As- 
suming that p, q, 7, etc., are pointers, and that the pointer names 
used in this discussion are also their names in the formal language, 
we get sentences of the form Tr(p), Fa(q), etc., where p, q, etc., are 
pointers to sentences of this very same language. 


3 “Operational Pointer Semantics: Solution to Self-referential Puzzles I,” Theo- . , 
retical Aspects of Reasoning about Knowledge [hereafter TARK], M. Vardi, ed. 7 


(Los Angeles: Morgan Ka , 1988), pp. 43—59. 


, 
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The goal is an assignment of truth values to all the pointers. It is 
achieved by an algorithm—also called the evaluation procedure— 
which is based on a collection of rules for assigning truth values. 
They are naturally divided into three groups. The first, the standard 
rules group, consists of rules that correspond to the traditional 
rules for assigning standard values; they include also rules for the 
truth predicates. 


Standard rules. Pointers to sentences of the form P(t;,.. , ty), 
where P is a nonsemantic predicate, get their truth values in the usual 
way, via the interpretation of the predicates and the denoting terms. 
Such sentences are called basic. Evidently, different pointers to the 
same basic sentence get the same truth value. It is a standard value 
determined by sentence type alone. And this holds as long as only 
standard rules are employed. 

Now, if p points to a nonbasic sentence, then, in the usual order of 
things, its value is determined by the values of other pointers, which, so 
to speak, are called directly by p. Thus, if p points to the disjunction 
A V B, then the pointers it calls directly are its derived pointers p1 and 
p2 (which, recall, pomt to A and to B). If one of them gets the value T, 
so does p (no matter whether the other has been assigned a value, or 
what that value is). If both get F, p gets F as well. The standard rule for 
negation, i.e., the “toggling” of the standard truth value, should be 
obvious. Other connectives are treated by expressing them in terms of 
V and 7. 

A pointer, p, to a quantified sentence, calls directly each of the 
pointers p|¢ to the substitution instances. The rules for quantified sen- 
tences are obtained by treating them as (possibly infinite) disjunctions 
or conjunctions. For the sake of convenience, I assume that every ob- 
ject in the interpretation’s universe is named by some term and that 
quantifiers are interpreted substitutionally. This is not essential. Quan- 
tifiers can be interpreted referentially, at the cost of adding some obvi- 
ous items.‘ 

Finally, a pointer to Tr(q), or to Fa(q), calls directly q, if q gets a 
standard value, then p gets, in the first case, the same value; in the 
second case, the opposite value. These are the standard rules for the 
truth predicates. 


The essential part of the algorithm, by which it enhances classical 
logic, is constituted by two additional groups: the gap rules deter- 
mine the cases of failure, where GAP is assigned. The jump rules 
determine assignments of standard values, which are based on pre- 


“ Namely, pointers would point to well formed formulas (not only to sentences) 
and truth values would be given to pairs consisting of pointers and assignments of 
objects to variables 
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vious failures. They enable us to use new pointers in order to make 
semantic assertions about previously failed ones. Before I go into 
these two groups, here is a simple illustrative example. 

Let John, Joan, Jack, and Jill make the following statements: 


John: What Jill says is not true and what Joan says is not true. 
Jill: What Jack says is true. 

Joan: What Jill says is not true. 

Jack: Either McX’s conjecture 1s true or what Jill says is false 


For simplicity, we can use here each speaker as a pointer to his 
uttered sentence. The resulting network is represented in figure 1. 
The arrows between pointers (the nodes) do not represent pointing, 
which is a relation between pointers and sentence types, but the 
above mentioned direct calls between pointers. McX’s conjecture, 
represented as a box, is not spelled out. It might involve a whole 
network of its own; but I assume, for the purpose of illustration, 
that it does not refer, directly or indirectly, to any of the four ut- 
terances. 

Actually, the diagram has been simplified by merging certain 
arrows and hiding certain derived pointers; e.g., the arrow from 
John to Jill should be split into an arrow from John to John] (which 
points to the first conjunct of what John says), an arrow from John] 
to John11 (which points to ‘What Jill says is true’), and an arrow 
from John11 to Jill. Note that the pointed-to sentence types can be 
read from the diagram, though they are not explicitly displayed. 


John 
(and) a, 
y NY 
Jill l not true = ian 
| 


4, sack 3 


Vv Pe 
(or) 
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The algorithm is applied recursively. Assume that it results in 
assigning F to McX. This leaves Jack and Jill in a closed unresolved 
loop. Both get at this stage the value GAP. Then Joan, who makes an 
assertion about Jill and does not belong to the loop, gets a standard 
value (here a jump rule is employed). Since her assertion is true, she 
gets T. In the same way, the first conjunct in John’s assertion gets T. 
But the second conjunct (which asserts the nontruth of Joan) gets F, 
hence John gets F. Had McX got T, Jack and Jill would have got T, 
while John and Joan would have got F. 

In the case of the two-line puzzle the network is: 


eng you 


Figure 2 


For ‘r’ read ‘the sentence token on line ‘i’. Right at the beginning we 
get a closed loop consisting of the pointer 1. It gets the value GAP. 
Then 2 gets the value T. 

In the four-speaker example, the standard rules for Tr and for 
negation are used for determining the value of John2 (which points 
to the second conjunct of what John says). Also, the standard rule 
for conjunctions is used for determining the value of John. The 
additional rule groups are as follows. 


Gap rules. The most important of these is the closed loop rule, 

which states the following: if, in the course of applying the evaluation 
procedure, a closed unevaluated loop forms and none of its members 
can be assigned a standard value by any of the rules, then all of its 
members are assigned GAP in a single evaluation step. 
[A closed unsvaluated loop is a set, say L, of unevaluated pointers (i.e., 
which have not been assigned truth values), satisfying the following two 
conditions: (1) unevaluated pointers called directly by any member of L 
also belong to L; (2) from each member of L one can reach any other 
member of L by one, or more, finitely many direct calls; i.e., by a 
sequence pi, - ., Pa, where n > 1, the p,s are in L and each calls 
directly its successor.] 
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In the four-speaker example, Jack and Jill get their value via the 
closed loop rule. And in the two-line example this rule applies to the 
token on line 1, which forms by itself a closed unevaluated loop. In 
both examples the loops are simple cycles. But, in general, the 
closed unevaluated loop can be as complex as you can imagine and 
might consist of any number, finite or infinite, of pointers. 

There is also a simple gap rule that states: a pointer is assigned 
GAP if all pointers called directly by it have been assigned values, 
but the pointer cannot be assigned a standard value by any of 
the rules.” 


Jump rules. Assume that q points either to Tr(p) or to Fa(p), and 
that p, but not q, has already been assigned GAP. Then the jump rules 
(for Tr and for Fa) assign q the value F. 


To see how the jump rules work assume that an unevaluated r 
points to “Tr(p) and that p has been assigned GAP. Then, by the 
jump rule for Tr, rl (which points to Tr(p)) gets F; and, by the 
standard negation rule, r gets T. Thus, the jump rules provide for 
the possibility of successfully asserting that a given GAP pointer is 
not true, or that it is not false. Jump signifies here true ascent in the 
metalinguistic hierarchy. In our example, a jump rule (for 77) is 
needed in order to assign Joan and John] their values. It is also 
invoked in assigning a value to the line-2 token in the two-line 
puzzle. 

To sum up, the assignment of GAP signifies recognized failure: 
we decide that the pointers in question fail to evaluate to a standard 
value. The grounds for such a decision are given in the gap rules. 
The assignment of GAP can serve as a basis for further assignments 


5 Given the standard rules, the effect of this rule is to enforce Kleene’s strong 
three-valued truth tables A pointer p to a disjunction A V B is assigned GAP if 
pl and p2 have been assigned values that do not enable us to assign, by some 
other rule, Aadat value to p. this is the case where ane af pl, po acuned 
GAP and the other is assigned either GAP or F. Pes ive Salalah 
on a different set of standard rules (e.g., pans aap nee eee 
rule does not imply Kleene’s strong truth 

rhe cap rules ena the da ancha ot GAP Maea Eo fad 
ures. But GAP can be assigned also by an additional so-called give-up rule: if at 
some stage we are left with unevaluated Ge ee eee a 
apples, then all of them are assigned ‘AP. Gaps assigned by this rule do not 
constitute recognized failures, but are rather like Kripke's ‘ ‘undefined” u It can 
be shown that in all networks of fimitary type, i.e, where every pointer calls 
(directly or indirectly) finitely many pointers, the give-u rule is never employed. 
The aim of more sophisticated versions of the algori is to avoid, as far as 
possible, the employment of such a rule in nonfinitary networks. 
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of standard values via the jump rules. What is inexpressible in the 
usual denotational semantics is thus expressible through network 
evaluation. 

Various mathematical results, which I shall not go into here, es- 
tablish the desirable properties of the evaluation procedure. I shall 
only describe informally two of the basic results, whose significance, 
I hope, can be seen: 


Order independence. The rules can be applied in any order, to what- 
ever pointers; the (possibly transfinite) sequence of applications results 
in a unique total valuation which does not depend on the order of 
applications.® 

Limited pointer dissent (abbreviated as LPD). Two pointers to the 
same sentence are assigned different values only if one of the pointers 
is assigned GAP. This is also true if the pointed-to sentences are logi- 
cally equivalent. 


The evaluation procedure, also called an “algorithm,” has indeed 
the form of an algorithm: an exact prescription for applying a well- 
defined set of rules. But this “algorithmic” form should not disguise 
its true potential. It applies to all networks, including infinite ones 
of arbitrary cardinality, and it extends both the system of Alfred 
Tarski and that of Saul Kripke.’ Each of these is obtained by deleting 
from my algorithm certain rules: Tarski’s truth definition is what we 
get if we employ only the standard rules for basic sentences, connec- 
tives, and quantifiers. Kripke’s system is what we get if we employ 
only the standard rules—including the standard rules for the truth 
predicates. (In either case, pointers remaining at the end uneval- 
uated are assigned GAP, or “‘undefined.’’) It can be shown that, if 
the jump rules are not employed, then a pointer’s value depends 
only on the sentence it points to; truth values would be thus deter- 
mined by sentence types alone. 

Note that, in Kripke’s construction, both tokens in the two-line 
puzzle constitute gaps (i.e., remain unevaluated), as do the utter- 


° A rule application extends a given partial valuation. If only the standard rules 
are used, then these extensions constitute monotone operators and the order- 
independence claim follows easily by a fixed-point argument ‘With the and 
jump rules, however, the operators are not monotone The claim is nonetheless 
true, but the proof is much more mvolved : 

7 “Outline of a Theory of Truth,” this JOURNAL, LXXI, 19 (November 6, 1975): 
690-716. Repr. in Recent Essays on Truth and the Liar Paradox, Richard 
Martin, ed. (New York: Oxford, 1984). The essays cited in fns. 8, 9, 12, 27, and 
64 are also reprinted in this collection. 
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ances of the four speakers. In the variants of Anil Gupta® and Hans 
Herzberger® they are assigned oscillating values. The two major 
points of difference between my proposal and all the previous ones 
are, first, the use of pointers and, second, the gap and jump rules. 


Recovering the metalanguage hierarchy. There is a precise defini- 
tion, based on the evaluation procedure, which correlates with each 
pointer an ordinal number expressing an intuitive notion of ‘“‘meta- 
linguistic level.” The pointers can therefore be stratified, according 
to their levels, so as to reflect a Tarski-like hierarchy. 

Roughly speaking, a pointer’s level is the number of nested appli- 
cations of jump rules needed in order to evaluate it. In the four- 
speaker example, if a is the level of McX, then also Jack and Jill are 
on level a, whereas John and Joan are on level a + 1. We can also 
refine each of our levels so as to reflect a minor, or local, subhier- 
archy within it. Roughly speaking, the minor subhierarchy reflects 
the height of nested applications of the standard rules for the truth 
predicates, which are necessary—once the major level has been 
reached. 

Unlike Tarski’s hierarchy, we have a single language with a single 
truth predicate and the stratification is effected after the fact, ie., 
after the assignment of truth values. In this respect it is like the 
hierarchy obtained via Kripke’s construction; as he aptly puts it, the 
pointers (in his case—sentences) seek their own levels. But his whole 
hierarchy of grounded sentences corresponds (exactly) to our local 
subhierarchy of level 0. Real climbing starts where Kripke leaves off. 
All our nonzero levels are in his model gaps (“undefined”). 

The construing of truth as a predicate over pointers rather than 
types is related to the metalinguistic hierarchy as follows. Call 
pointers dissenting if they point to the same sentence and have 
different truth values. We know, by LPD, that of any two dissenting 
pointers one is assigned GAP. It turns out that the non-GAP 
pointer is one level higher in the hierarchy: 

Higher level dissent (abbreviated as HLD). For every two dissenting 

pointers, p, and pz, if p, is the GAP pointer then Up) = U(p,) + 1, 

where [(p,) is the level of p,. 


The significance of LPD and HLD will be discussed in section I. 
Before proceeding let me clarify certain points, answering thereby 


8 “Truth and Paradox,” Journal of Philosophical ie xı (1982): 1-60. 
? “Notes on Naive Semantics,” Journal of Phalosop ical Logic, x1 (1982) 
61-102 
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possible questions and allaying, perhaps, certain misgivings that 
might have arisen. Some of these points will be reconsidered, at 
greater detail, in later sections. 


What pointers express. Some might find the idea of assigning truth 
values to pointers strange; the phrase ‘that pointer is true’ may 
sound bizarre. Truth and falsity, so it is argued, have to do with what 
we say. Surely, it is inappropriate to endow some arbitrary objects, 
used for pointing to sentences, with truth or falsity. To argue thus is 
to misunderstand my proposal. The phrasing is not essential. It is 
not strange to speak of the truth or falsity of sentences, meaning 
thereby sentence tokens; and if it comes to pointers, one could 
speak of a pointer pointing to a truth, or to a falsity, or indicating, 
or failing to indicate one. Nothing marks a pointer as such except its 
function within a global arrangement. But this very function makes 
it a tool for expression, hence—something on which truth values 
can be hanged. 

As far as the ontology of statements, propositions, meanings, etc., 
is concerned, my proposal is neutral. A nominalist who bars inten- 
sional entities can use something like the evaluation procedure as 
part of the account of the functioning of language. But the proposal 
is compatible with seeing pointers (in the contexts of networks) as 
means of expressing propositions, or statements.!° On a pretheo- 
retic level, it is not difficult to make sense of “the proposition ex- 
pressed,” or “the statement made” by means of a pointer. For exam- 
ple, what is expressed by the sentence token on line 2 is simply that 
the sentence token on line 1 is not true, which is to say: is not 
assigned the value T by the evaluation procedure. Although the 
token on line 1 is of the same sentence type, it fails, because of the 
loop, to express this. Similarly, in the Jack and Jill exchange, Jack 
says of himself that he does not express a proposition, but in this 
very Saying no proposition is expressed (at least not the proposition 
indicated by his words, at their face value.) What Jack seems—but 
has failed—to express is successfully expressed by Jill. If only sen- 
tence types are considered, Jack and Jill appear to have stated the 
same thing. Actually they did not. The source of the illusion is that 
they employed sentence types that are, in the given conversational 
context, logically equivalent. 

The following can perhaps add clarification. Two intuitive theses 
about truth lead to the strong liar: 


1¢ Cf fn. 55 concerning ‘propositions’ and ‘statements’. 
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(T1) If the sentence ‘x is not true’ is true, then x is not true. 


(T2) Lf x is not true, then the sentence ‘x is not true’ is true. 


Here ‘x’ ranges over sentence types and quotes are interpreted as 
Quine’s semiquotes (e., the quoted sentence, which is x’s value, is 
to replace ‘x’). We get the strong liar when x is, or is equivalent to, 
‘x is not true’. But if we construe ‘true’ as a predicate of pointers, 
then ‘x’ should range over pointers and (T1) and (T2) should be 
modified by replacing ‘the sentence’ by ‘any pointer to the sen- 
tence’. We can still maintain (T1) in its new form, but we have to 


qualify (T2) by rewriting it as: 


If x is not true then any unfailed pointer to ‘x is not true’ is 
y po 
true. 


The paradox is thus blocked by allowing pointers—in particular, 
tokens—to fail, and by giving up (T2) for failed tokens of ‘x is not 
true’. If we employ the analogous strategy for the case of types, and 
x is a liar sentence, then it is the type ‘x is not true’ which fails. Thus, 
we cannot use it to say that x is not true. Further observations 
concerning pointers and propositions are made in section IV. 


An essentially non-Tarskian semantics. In Tarskian semantics, 
truth is a property of sentence types; tokens as such are not consid- 
ered. That tokens play a central role in natural language, due to the 
abundance of indexicals and demonstratives (words like ‘I’, ‘you’, 
‘now’, ‘this’, ‘that’) is, of course, very old news. These terms, having 
highly variable context-dependent denotations, cause truth value to 
vary according to token. 

But it would seem that the Tarskian framework can be adjusted, 
or rather supplemented, so as to handle these phenomena. Having 
replaced the items of varying denotation by others, whose denota- 
tions—within the considered discourse—are fixed, we can apply 
Tarskian semantics. To use an admittedly simple example, trans- 
form ‘He is tall’ to ‘X is tall’, where ‘X’ is an appropriate context- 
insensitive name or description of the indicated person, and evalu- 
ate the result as a sentence type. If needed, a further reduction can 
be used to eliminate the temporal indexicality of ‘is’. The truth pre- 
dicate that is defined over tokens is thus reduced to a predicate 
defined a la Tarski over types. 

The present situation is altogether different. The predicate de- 
fined by the evaluation procedure is irreducibly over tokens, or— 
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more generally—over pointers.'' Here, as I shall show, differences 
in truth value are not traceable to terms that behave like indericals, 
or have ambiguous denotations, which occur in the sentences. 

Attempts, like Tyler Burge’s,’ to resolve the semantic paradoxes 
by construing ‘true’ as an indexical cannot but fail to achieve that 
goal. Indeed, as the analysis will show, Burge’s account puts all the 
burden of work on a rather obscure mechanism of implicatures, 
leaving indexicality idle. (The general argument concerning indexi- 
cality is laid out in section III; the more specific discussion of 
Burge’s work is in section V.) 

Much more significant than the failure of indexicality is the 
change from compositional semantics, whereby the meaning of a 
sentence is “composed” of the meaning of its parts, to a radically 
different one, in which tokens, or pointers, are evaluated globally. 
The present proposal shows that this essentially different picture 
can be embodied in a formalism as precise as Tarski’s truth defini- 
tion for formal languages. 

An argument propounded by Donald Davidson has it that the 
ability of fmite beings—such as humans—to use a potentially infi- 
nite number of sentences is to be explained by assuming a finite 
number of rules,’* whereby meanings of compound expressions are 
determined by the meanings of their components. A setup of this 
nature resembles Tarski’s truth definition. That something of this 
kind underlies a great deal of the functioning of language cannot be 
doubted. But it is equally true that a great deal lies beyond it. If 
human ability for controlling a potential infinity is to be modeled by 
a formal device—a plausible working assumption, though far from 
an axiom—then the appropriate concept is that of an algorithm. A 
finitely axiomatized compositional semantics amounts to a rather 
special kind of algorithm. Appealing as the model is, it is more than 
plausible that the processing of language, on the level of meaning, 
involves also algorithms of quite a different nature. I shall say more 
on this score in section ITI. 


1! Formally, one can of course redefine “type” by stipulating that tokens that 
get different truth values (e.g, those on lines 1 and 2) are to be considered of 
different . Isaac Levi observed that such a stipulation might not be bizarre, 
given that token-type distinction is not always clear-cut and that homonyms 
might be considered as different Bd because of their different meanings. But if 
we do this, the very definition of “type” presupposes the assi t of truth 
values to tokens; ın this sense, the predicate 1s still i ly over tokens. 

12 “Semantical Paradox,” this JOURNAL, LXXVI, 4 (Aprl 1979): 169-98. 

13 On the syntactical level, Noam Chomsky had argued in this way for an innate 
language-processing mechanism 
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Truth, falsity, and GAP. In the pointer formalism, the predicates 
representing truth and falsity figure as two primitives. The evalua- 
tion rules relate truth and falsity, through negation, as follows. A 
pointer to the negation TA gets the value T, or the value F, iff the 
derived pointer to A gets the opposite value. Thus, our concept of 
falsity accords with the well-known thesis, according to which a sen- 
tence is false iff its negation is true. It is possible to set up an 
equivalent formalism that uses only Tr, or only Fa, as a primitive.’ 

Let ‘false,’ denote falsity in the sense employed here. Another way 
of relating falsity to truth is given by ‘false,’. A sentence is false, just 
when it is not true.” Consequently, every sentence is either true or 
false,. When one reads ‘false’ as ‘false,’ there is a miraculous disap- 
pearance of truth-value gaps. Actually, they are there, thinly dis- 
guised and easy to uncover. A sentence is a gap if and only if both it 
and its negation are false,. 

As the last equivalence suggests, easy translations are available 
from one ‘false, terminology to the other, making the two equiva- 
lent for purposes of expression. The present formalism can be re- 
cast in terms of ‘false,’, at some cost of additional structure.'® 

The false, interpretation might remind one of cases, arising in 
Bertrand Russell’s theory of descriptions, where sentences and their 
“negations” come out false; e.g., (i) ‘The present king of France is 
bald’ and (ii) ‘The present king of France is not bald’.’7 But the 
similarity is only apparent. The falsity of (ii) is caused by the narrow 


4 We can define the evaluation procedure for a subsystem based on a single 
truth predicate. But to have the expressive expres Foire QO Pee reen! em, we need 
a way to express both Tr(p) and Falp), ¢.g., to say that p 
is a gap (Tp) A —Fa(p)). This requires an additional rica item for 
pointers: a negation operant that associates with every pointer p a pointer, say 
neg(p), to the ion of the sentence pointed to by p. We can then translate 
Fa(p into Trinegt?)) (We should stipulate that ne ve and that if p points to 
a negation then neg(pl) = p). Note that, in neg(p) will not be a token, 
even if p is. The present formalism avoids the need for a negation operant; hence, 
unless we consider quantifiers, we can assume that all our pointers are just tokens. 

15 Such a reading of ‘false’, fy He ate Ce deale ee T 
gord bean ert “What 1s Russell’s Theory of Descriptions?” Physics, 

c, and History, W. Yourgrau and A. D. eds. (New York: Plenum, 
1970). ee ee p in Jon Barwise and John Etchemendy’s 
The Liar: An Essay tn Truth and Circulanty (New York: Oxford, 1987). But 
the issues there, to be discussed in section IV, are complicated by the treatment of 


negation 

i$ We shall need a negation operant for pointers; cf. fn. 14. 

17 I am ignoring here the more radical Russellian position according to which 
definite descriptions are mcomplete symbols to be eliminated, in the final ac- 
count, altogether. 
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scope of the negation. True negation—one that applies to the whole 
sentence—does convert falsity to truth: (ii) ‘It is not the case that 
the present king of France is bald’ comes out true. Much of the 
appeal of Russell’s theory is due to its ability to distinguish between 
local negation, as exemplified in (ii), and true negation, as exem- 
plified in (iii). There are no gaps here. Acceptable or not, Rus- 
sell’s theory rests squarely within the framework of classical two- 
valued logic. 

The present situation is altogether different. The nontruth of the 
line-1 sentence (in the two-line puzzle) does not hinge on the re- 
stricted scope of ‘not’. Making that scope as wide and as unam- 
biguous as possible does not make the slightest difference. Consider 
for example: 


line 1: It ts not the case that the sentence on line I ts true. 


To classify this last sentence as false means to classify as false both a 
sentence x and ‘it is not the case that x’.’® And while it is conceivable 
for both x and ‘it is not the case that x’ to fail, it is inconceivable that 
both be false—unless ‘false’ is just another mark for failing. That is, 
‘is false’ is to be read here as is nontrue, or—astill better—fatls to be 
true, or fails to express a truth. To what extent this subsumption of 
failure under falsity reflects ordinary usage is a question for the 
lexicographer. 

A two-fold classification of all sentences into “true” and ‘“‘false” 
may serve to make a philosophical point.’® But it should be clear that 
by no means does it signify a two-valued logic. There is more to 
two-valued logic than there being just two ‘‘truth values.” The two 
values should underly the semantics of the sentential connectives. 
That they should work well with negation is the minimum re- 
quirement. 

To sum up, the construing of ‘false’ as ‘false,’ has no bearing on 
the existence of gaps, but is merely a terminological grouping of 
falsity G.e., truth of the negation) together with other kinds of fail- 
ures. There is a moral in this concerning propositions. Either we say 
that liar-type sentences express no propositions, or we make place 
for propositions that constitute truth-value gaps. 


The liar and the truth teller. The algorithm described so far treats 
the truth teller (a sentence asserting its own truth, e.g., “What I am 


18 Take as x the sentence ‘the sentence on line 1 is true’. If the sentence on line 
1 is false, so is x and so is ‘it is not the case that x’ 

19 As was done, for example, by Michae] Dummett, in “Truth,” Proceedings of 
the Aristotelian Society, LIX (1959): 141-62. 
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saying now is true’) in the same way as the liar; both get GAP. Our 
framework does, however, provide means for distinguishing be- 
tween the two. The distinction is made by considering consistent 
assignment of truth values, or what I call in the technical paper 
self-supporting valuations.” There is a total self-supporting valua- 
tion that assigns the truth teller (token) the value T, and there is one 
that assigns it F. But every self-supporting total valuation must as- 
sign the liar (token) the value GAP. 


Different variants of the algorithm. The present variant of the 
algorithm leads to a truth-value assignment that conforms to 
Kleene’s three-valued truth tables (cf. fn. 5). Some may, however, 
prefer to assign T to all pointers to tautologies. This can be achieved 
by using a variant in which the set of standard rules is replaced by 
another set, based on supervaluatzons. In the usual supervaluation 
method—which applies to sentence types—a sentence is given a 
standard value whenever it gets this value in all possible complete 
standard extensions of the partial valuation. This idea can be 
adopted to assignments of values to pointers; I defer details to the 
technical paper. 

It is noteworthy that, to get the supervaluation variant, only the 
standard rules are changed. The gap and jump rules remain the 
same.® In other variants as well, changes can be done “locally” 
within this or that group of rules. This illustrates the modular char- 
acter of these groups. 

More sophisticated variants, not to be presented here, incorpo- 
rate additional gap rules. These rules introduce grounds for discrim- 
inating between pointers. They enable early assignments of GAP to 
some of the pointers, thereby salvaging the remaining ones that 
would have otherwise been all members of the same closed loop. 
The total effect is to reduce the assignment of GAP and to make 
more pointers available as means of expression. Such variants aim at 
achieving higher degrees of self-contained systems for some highly 
expressive languages—a goal that will become clearer in the next 


section. 
II BLACK HOLES 


Suppose that we assign truth values to sentence types. Consider any 
notion of sentence equivalence, which preserves truth value and 


= Roughly speaking, a self-supporting total valuation 1s one that is closed under 
rule applications: applying to it the rules of the algorithm yields the same valua- 
tion. 

"1 Changing the standard rules can affect the applicability of the gap rules, 
because a gap rule can apply only if other rules cannot. The gap rules themselves, 
however, need no change 
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which satisfies the following two mild conditions. (i) Each sentence is 
equivalent to itself. (ii) If ‘...’ i 
equivalent sentences, then ‘.. . is not true’ is equivalent to ‘—— is 
not true’. Let us now extend the original two-line text as follows: 





line 1: The sentence on line 1 1s not true. 
line 2: The sentence on line 1 is not true. 
line 3: The sentence on line 2 is not true. 


line 4: The sentence on line 3 is not true. 


line n + 1: The sentence on line n is not true. 


The sentences on the first two lines are the same, hence—equiva- 
lent. And, by (ii), the equivalence of the sentences on lines k — 1 and 

` k implies that of the sentences on lines k and k + 1. Consequently, 
all the sentences are equivalent. Since the sentence on line 1 is not 
true, the sentence on line 3 is not true either, but what I have just 
Stated is not true, because it is the sentence on line 4 (or an obvi- 
ously equivalent reformulation of it), and also this last statement of 
mine is not true, because it is the sentence on line 5, etc. None of 
these sentences can be successfully asserted, because none of them 
is true; but again I find myself slipping into nontruth: what I have 
just said is not true for it obviously includes the conjunction of these 
very same sentences; and also thts last assertion is not true, and so 
on ad infinitum. 

The innocent looking gap has developed into a black hole that 
sucks into itself every sentence asserting that some sentence in the 
hole is not true.” Formally, a hole hierarchy is definable as follows: 
S is a O0-hole if it is a gap and, by induction: S is an n + 1-hole if it is 
an n-hole and ‘S is true’ and ‘S is false’ are gaps. Then S is a black 
hole if it is an n-hole for all n. (Here and elsewhere, quotes are to be 
read as semiquotes, whenever this is obvious from the text.) We shall 


2 That some such minimal equivalence notion should obtain 1s inherent in the 

very assignipent of truth values to sentence types The giving up of Qi) would 
saoi dat something olier han sentence sped raed in determining truth 
value. 

B So far we assumed an equivalence the above mentioned conditions, 
and the following rules that i that the line-1 sentence is a gap. (1) If ‘S is true’ 
1s true then S 1s true. (2) If ‘S is not true’ is true then S is not true. (3) If the 
negation of a sentence is false then the sentence is true. We shall also assume, in 
the rest of the discussion, that if the negation of a sentence is true then the 
sentence is false and the equivalence of ‘S is false’ and ‘not-S is true’. 
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use ‘hole’, unqualified, to denote a 1-hole. Thus, we cannot assert of 
a hole that it is not true, or not false, or a gap (neither true nor 
false), without this second assertion being itself a gap. 

Of course, syntactical properties can be asserted of any sentence. 
But no information about the truth value of a hole can be conveyed 
directly. And as for black holes, they are semantic untouchables. No 
semantic information about them can be conveyed in any way, be it 
as indirect and across as many layers as one may wish. 

Actually, the arguments produced so far in the case of our exam- 
ple imply only that the sentence on line 1 is a “‘semiblack hole’; we 
have only iterated ‘. . . is not true’ but not ‘. . . is not false’ .** This is, 
however, bad enough. And it becomes a fully fledged black hole, 
under some additional very natural assumptions. In particular, it is 
easily seen that every gap is a black hole if we assume that S and ‘S is 
true’ are equivalent, under an equivalence relation satisfying Leib- 
niz’s substitutivity law.” (Note that this is not the Tarski bicondi- 
tional: ‘S’ is true + S, but only the requirement that ‘S is true’ and S 
should get.the same truth value, including GAP.) It is difficult to see 
how this equivalence can be avoided if truth values depend only on 
sentence types. Indeed, it holds in the system of Kripke (op. cti.; 
discovered independently around 1974 by Walther Kindt”), in its 
Martin-Woodruff”’ forerunner and in the variants of Gupta (op. 
cit.) and of Herzberger (op. cit.). Every truth-value gap is a black 
hole in these systems. 

When truth values depend only on types, black holes are thus 
bound to occur on the most elementary levels of language. (Think 
how little is needed to produce sentences like the sentence on line 
1.) Within such a system the little game I have just played, of repeat- 
edly refuting myself by trying to assert the “‘unassertable,”’ signifies 
a genuine metaphysical mystery. And yet the trick is quite transpar- 
ent and we, Jack and Jill included, have no trouble at all asserting 


™ Our minimal assumptons (cf. tbid.) do not imply that ‘The sentence on line 1 
is false’ is a gap. This can be shown by constructing a model—to be sure, a highly 
pathological one—satisfying the minimal assumptions, in which this last sentence 
is false 


™ That is to say, the equivalence of S, and S, entails that of ‘S, is true’ and ‘S is 
true’ and sumilartly for ‘ *. Actually, for the sentence on line n to be a black 
hole a weaker assumption than the i ce of S and ‘S is true’ will suffice. 


E.g., for n = 2 the assumption is that ‘S is true’ 1s equivalent to ‘S is true’ 1s true’. 
For larger n one iterates ‘is true’ accordingly, the assumption becomes weaker 
with each iteration 

T$ “Introduction of the Truth Predicates into First-order ,” m For- 
mal Semantics and Pragmatics for Natural Languages, Guenthner and 
Schmidt, eds (Cambridge. Reidel, 1979). 

T “On Representing “True-in-L’ in L,” Philosophia, v (1975): 213-7. 
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this sort of “unassertable” by employing another token of the very 
same troublesome sentence. Language usage involves a mechanism 
for solving the black-hole dilemma. And if our goal is to model 
natural language, or a basic aspect of it, then the elimination of 
black holes—at least those of the simpler varieties—should be an 
overriding concern. Failure in this respect means that we have failed 
to capture something very essential to the functioning of language. 

Although Kripke notices the failure, he plays it down, not giving 
the issue the central place it deserves.” “The sense [so he writes] in 
which we can say, in a natural language, that a liar sentence is not 
true must be thought of as associated with a later stage in the devel- 
opment of natural language, one in which the speakers reflect on 
the generation process leading to the minimal fixed point.” 

Indeed, the nontruth of the sentence on line 1 hinges on a sort of 
metareasoning. But this does not place the assertion on line 2 at a 
“later stage in the development of natural language.” Certainly not 
at a stage that comes after the construction of the minimal fixed 
point, an inductive process that moves through all recursive ordin- 
als! The fact that, after carrying out this transfinite construction, we 
are still left with the elementary problem of making sense of what 
Jack and Jill said shows that, far from having completed a home run, 
we are still at first base. 

The basic (and simplest) version of my proposal eliminates the 
holes in all situations of finitary type—those in which every pointer 
calls only finitely many pointers.” To this type of network, which I 
call locally finite, belong all the tangled loops that can arise when 
finitely many people make assertions about each other, provided 
that altogether finitely many assertions are involved. In particular, it 
contains the examples discussed in the literature. The absence of 
holes is formally expressed by the following result, which holds for 
locally finite networks under quite general conditions:*° 


If p has the value GAP, then there is a pointer q to the sentence 77r(p) 
A “Fa(p) (i.e., the sentence asserting that p is a gap), such that q has 
the value T. 


We can thus successfully assert, of any gap, that it is a gap. An- 
other result, which applies under the same conditions, states that 


int has been nicely put by Burge (op. ctt.). 

ao inter q if one can reach q from p by one or more, finitely 
nny ete ee “Preview” where the rithm is described. 
™ The required condition is that there are infinitely many independent inters 
to each sentence type, where “i t pointers” are those which have no 
common subpointers, a subpointer being the natural generalization of “‘subto- 
ken.” 
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every sentence has a pointer to it that has a standard (T or F) value. 
(Recall that, by LPD, pointers to the same sentence, which get stan- 
dard values, always agree in value.) 

In certain situations of infimitary type, when the language is suffi- 
ciently expressive, holes, even black holes, appear. Of these, some 
are eliminable by more sophisticated versions of the algorithm and it 
appears that further improvements are possible. It is far from clear 
how far one can go toward the elimination of holes. In the context 
of mathematical languages, the elimination of holes will bring us 
near to what one might consider a “universal language’’—a self- 
contained system, one that includes its own semantics. For this rea- 
son alone it is highly plausible that rich languages must have holes. 
Note, however, that we have no formal proof of this. The standard 
argument from the liar is no longer valid for setups of the kind 
proposed here. But perhaps the question of proof should not even 
be raised at the present stage, since it is not clear what the present 
proposal might, in further developments, encompass. 

In the broader perspectives of natural language “‘hole-like’’ phe- 
nomena are bound to occur. Some discourses take us to the edge of 
meaning, whereby we are tempted to express the inexpressible or to 
think the unthinkable. Holes are perhaps the inevitable price for a 
powerful language capable of evolving. Indeed, some situations can 
be described only by saying that we have changed the language, or 
that we have altered the very fabric of our conceptual framework. 
But I do not think that the Jack and Jill exchange calls for such a 
description. Here we can at least tidy up the more elementary levels 
of language. 

A comparison with set theory comes naturally to mind. The anal- 
ogy between the semantic paradoxes and those of set theory is well- 
known. Russell grouped them together and proposed the same kind 
of treatment.*! His solution is based on a rigid predetermined classi- 
fication of sets, which makes room for quantifying over sets of the 
same type, but not over sets in general. The hierarchy is incorpo- 
rated into language itself. Axiomatic set theory, say ZF, has disposed 
of the built-in hierarchy, replacing the aggregate of concepts set-of- 
type-i by a single concept of set. 

Russell's hierarchy of sets parallels his hierarchy of propositions, 
of which Tarski’s hierarchy of metalanguages is a close relative. The 


5! More recently, the analogy has been emphamzed by Parsons Both types of 
paradoxes have been by me as cases of what I called an infinity paradox: 
a contradiction that results from an attempt to construct self-containing systems. 
Cf. my ‘Paradoxes of Infinity and Self-application,” Erkennints, xx (1983): 131- . | 
55. f a 


244 THE JOURNAL OF PHILOSOPHY 


representation of Tarski’s hierarchy in a formal system yields an 
aggregate of truth predicates true,, where i ranges over some list 
fixed in advance, but no single true. 

In mathematical setups the problem is not acute. First, because 
‘true’ is used there in a restricted way, which bars self-reference. 
Second, because, working inside set theory, we can assign truth val- 
ues to mathematical statements—as long as these statements are 
about structures that are sets. A closer look at the second point 
shows that the metalinguistic hierarchy has been subsumed under 
the hierarchy of sets (the one obtained by iterating the power-set 
operation). Problems resurge when the structures in question are no 
longer sets; in particular, when truth in ‘“‘Cantor’s universe” is con- 
sidered. In this respect, axiomatic set theory has not solved the 
problems arising out of the set-theoretic paradoxes. It has only 
shifted them to the theory’s fringes, where they are of no concern, 
except to set theoreticians who investigate the theory’s borders (for 
a detailed discussion, cf. ibid.). 

This perspective marks a clear goal for the semantics of ‘true’. A 
semantics of formal languages—containing their own truth predi- 
cate—should do for the concept of truth what set theory, such as 
ZF, does for the concept of set. By minimizing the black-hole phe- 
nomena we are shifting the problematic points, where the system’s 
inadequacies show, to the less accessible parts of the system. 

Whereas set theory is informed by mathematical thought, pointer 
semantics draws directly on insights derived from natural language. 
(Mathematics itself, however, is an outgrowth of thought manifested 
in ordinary language.) Yet it results in systems that are as formally 
precise as any piece of mathematics. The present work can be seen 
as a first step that eliminates black holes at the basic linguistic level. 
As mentioned earlier, black holes reappear, in highly expressive lan- 
guages. These are cases in which the pointing relation itself is de- 
scribable in the language and Gédel’s techniques are available. At- 
tempts to eliminate such black holes lead to more sophisticated ver- 
sions of the algorithm and to some intricate, interesting questions. 
Thus, the formalization of mechanisms inherent in linguistic prac- 
tice also gives rise to foundational questions of logic. 

IU. TOKENS, TYPES, AND COMPOSITIONAL SEMANTICS 
Let me start with a commonplace observation. When a child learns a 
language he encounters first of all tokens of linguistic entities. The 
child acquires the language by learning the effects of token display 
and by participating in the token-display exchange. This observation 
does not prove that tokens are prior (in some nontrivial sense) to 
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types; for the ability to learn the language is preconditioned by the 
ability to recognize token similarity and, thereby, to classify tokens 
according to type. The observation is made in order to underline the 
obvious fact that there are many ways of passing information by 
means of an interactive game-like process. A useful description of 
the game must, inevitably, ignore through abstraction various 
aspects; otherwise, we shall be submerged under sheer detail.” Of 
course, we may ignore only inasmuch as it does not restrict us seri- 
ously in accounting for what is going on. There are thus two con- 
flicting requirements: that the account we give be sufficiently sim- 
ple, and that it be adequate for handling a broad enough class of 
applications. Usually there is trade-off between the two. 

The fixing on sentence types, as on the basic truth-carrier units, is 
a natural move. It leads to an enormous simplification, and it clears 
the way for compositional semantics: a semantics whereby linguistic 
entities are analyzed as complexes whose meanings derive in a sys- 
tematic way from that of their components, via certain—relatively 
few—recursive rules. The truth value of a sentence is thus deter- 
mined recursively; the context in which the sentence occurs can be 
completely ignored. 

A description of this kind fits accurately formal languages con- 
structed by logicians; which is no wonder, since it is the very princi- 
ple on which such languages are constructed. It does not fit natural 
languages, where context dependencies abound, due to indexicals, 
demonstratives, and ambiguous terms. Nonetheless, it reflects a very 
essential aspect of natural language. For if the dependency of truth 
value on context can be traced to the shifting denotations of certain 
context-sensitive terms, then once these denotations have been de- 
termined—how, is another matter—we can derive truth values by 
means of the above mentioned recursive rules.” 

Call two tokens dissenting if they are tokens of the same sentence 
type, but are accorded different truth values. Then the account just 
outlined amounts to the following. If p and q are dissenting tokens, 
then the sentence type should contain terms, say ti, tg, ..-, ta, 
whose references differ on the two occasions. If we are to replace 
each t, by a suitable ¢,, which denotes on both occasions what t, 
denotes in the context of p, the resulting tokens, say f’ and q', would 


* On a more fundamental level, the very concept of description means that 
many details have been ignored 
3 Such a view underlies Davidson's semantic program 
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have the same truth value, namely, that of p. We can call such an 
elimination of context dependency local disambiguation.™* 

For example, when ‘I am tall’ is uttered by two speakers, only one 
of whom is tall, the conflict in truth value is traceable to the differ- 
ent denotations of ‘I’. The dissent can be resolved by using proper 
names. If needed, a name can be introduced by stipulation: “Let S1 
be the first speaker.” In the counterfactual situation, where the two 
speakers use ‘Sl’ instead of ‘I’, the two utterances have the same 
truth value: that of the first speaker’s original utterance.” Or con- 
sider two tokens of ‘tanks are never made of wood’, one in a treatise 
on weapons, the other in an essay on storing methods. The differ- 
ence in truth value is due to ‘tanks’, and is eliminable by substituting 
the term by ‘armored vehicles on caterpillar treads’.* 

The picture is radically different when every term contained in 
the sentence type has the same denotation, on both occurrences in 
two dissenting tokens. Here, a truth-value difference cannot be re- 
duced to denotational shifts of sentence parts. It is accounted only 
by the token’s role as a unit within some global arrangement. And 
such, I hope to show, are the cases deriving from the self-referential 
usage of ‘true’. The argument for the failure of indexicality shows, 
in fact, the failure of the usual compositional semantics.°” 

First, I take it that in the two-line puzzle, and in similar cases, we 
do have token dissent. We do want to say that the sentence on line 1 
is not true but the one on line 2 is; we do side with Jill against Jack. 


H The “context insensitivity’ of the terms t; is not necessarily absolute. It only 
means that they have the same denotations throughout some wider context, which 
includes that of p and that of q. 

The terms t, can be either noun phrases or predicates. I assume here, for the 
sake of simplicity, extensional contexts But this is not essential. Local disambig- 
Sey es at previ ave ay ne eee 
equivalence notion The term ż, should be intensionalty equivalent to trin 
context-of-p. 

® Instead of changing, counterfactually, the original utterances, we can let the 
two speakers reutter the sentence, with ‘I’ replaced by ‘S1’, the natural assump- 
tion being that, in the given circumstance, an utterance and its repetition by the 
same speaker are equivalent This strategy does not work if the indenical is time- 
sensitive, unless the repetiuon is sufficiently close so as to make the ume differ- 
ence msignificant 

** In lumping together indexicals with cases of ambiguity, I am not glossing 
over what distinguishes the first The “ambiguity” of ‘I’ means that its denotation 
shifts according to context. Because the shift is systematic, ‘I’ can be regarded, in 
one sense of ‘meaning’, as having a fixed meanmg: it always denotes the speaker. 
Where no such systematic account 1 available, we must rest content with what 
informal p tics will 

Y The ‘usual’ is added here, in order not to rule out other types of 
ana, For example, one may consider composition of networks. 
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Or, to use Buridan’s example,” if Plato says at time t: “What Aristo- 
tle says at time ¢ is not true,” and Aristotle says, at the same time, the 
same about Plato, then both sayings are not true; but if a third party 
were to assert at that time that what Plato (or what Aristotle) says is 
not true, Ats saying would be true. 

Evidently, ‘The sentence on line 1’ in the two-line puzzle refers in 
both tokens to the same object, be this sentence type or sentence 
token. The only candidate whose meaning, or denotation, might 
have changed is ‘true’. But note that the argument for the nontruth 
of the line-1 sentence employs ‘true’ in the very same sense of the 
‘true’ occurring on line 1.® The argument simply shows that the 
sentence on line 1 does not fall under what ‘true’ tn this very sen- 
tence token denotes. And if this is what we write on line 2, then the 
‘true’ on the second line has the same denotation as that on the first. 
(Indeed, if the ‘true’ on line 2 is construed as being on a higher 
level, then the sentence on line 2 is false!) The difference in truth 
value is not caused by some shift involving the ‘true’ that occurs in 
the sentence, because no shift takes place. 

Interestingly, the analysis just offered accords with that given by 
Burge in his discussion of a similar puzzle (op. cit.). Phrased in the 
present terms, his conclusion is that the tokens of ‘true’ on the two 
lines should be construed as standing for the same truth concept. 
The difference in truth values is traced by him to different implica- 
tures involved in evaluating the sentence. (As will be shown in sec- 
tion V, some unspecified implicature mechanism is implicitly pre- 
supposed by Burge throughout, while his indexicality account does 
no actual work.) 

Although I find the foregoing argument sufficient for my pur- 
pose, I shall supplement it by another, which is, I think, of indepen- 
dent interest. Again, I consider the two-line puzzle. I shall not ap- 
peal to how we derive the conclusion about the line-1 sentence, 
however, but proceed in a more general way. Let the ‘true’ on line 1 
be changed to a nonindexical ‘true*’ that denotes once and for all 
what ‘true’ in the original context denotes.*° We thus freeze ‘true’ in 
its line-1 context. The freezing, one feels, should not affect anything 


* Quoted in Burge, op. cit. 

* The argument rests on (T1). If ‘x is not true’ is true, than x is not true. (And 
in this phrasing the various occurrences of ‘true’ denote the same truth concept, 
whatever that may be.) The familar argument now runs thus. If the line-1 sen- 
tence falls under what ‘true’ in the line-] context denotes, then by (T1) that 
sentence does not fall under what ‘true’ in the line-1 context denotes. 

* Either the change 1s effected by erasing and rewntng, or we imagine a coun- 
terfactual situation in which the second version is written instead of the first. 
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that has to do with truth. Saying, while addressing Jack, ‘you are tall’ 
and saying ‘Jack is tall’ come to the same. 

There is yet a subtle complication: By changing ‘true’ to ‘true*’ we 
have altered the reference of ‘the sentence on line 1’. There is thus 
the additional change in what that noun phrase denotes. Nonethe- 
less, as far as truth or nontruth is concerned, this added difference 
should not matter. It might have, had the sentence said something 
about lexical properties. Changing 


You are tall and this last utterance of mine is short. 
said by Jill to Jack, to 


Jack Horatio MacWiltington the Third is tall and this last utterance of 

mine is short. 
makes a difference. But with ‘true’ instead of ‘short’, the two come 
to the same.*? 

Since the original line-1 sentence is not true, its frozen version is 
not true as well. And since we can assume that truth* satisfies (T1) of 
the “Preview,” this being a minimal requirement of any truth con- 
cept, it obviously follows (cf. fn. 39) that the frozen version is not 
true*. Having written this last conclusion on line 2, we get another 
token of the frozen version, this time a true one. We have eliminated 
the indexical context dependency, but we get back two dissenting 
tokens. Local disambiguation fails. 

Thus, indexicality, or anything like it, cannot account for truth- 
value changes accruing through the self-referential usage of ‘true’. 
This is not to say that the picture of truth as involving meaning 
variations is altogether wrong. Something like truth-level shift does 
take place. And it is important to disentangle this true aspect from 
the wrong idea that ‘true’ is indexical. 

We can, in fact, view the judging of the sentence on line 2 as 
taking place on a higher level than the judging of the sentence on 
line 1. But the shift to higher level is determined directly by the 
place of the token in the network, not by the mediation of context- 


‘1 The analogy uw not altogether complete. With the changes in two separate 
conjuncts: ‘you are tall’ and ‘this last utterance of mine is true’, we can argue 
idee Ae AS eg ecb ii e T NGO a 
when the changes are in the same atomic sentence. But this cannot provide 
A ead T Ranta a true’ to ‘true*’ as a cause for truth-value 
change. Whoever has misgivings on this score may consider a variant of the liar in 
which the sentence is, so to speak, split: on line 0 we have, ‘The sentence on line 1 
is true’, and on lme 1, The sentence on line 0 1s not true’. Can the freezing of 
‘true’ on line 1 make a difference? I cannot see that it should matter for the 
sentence on line 0 and, by consequence, for the sentence on line 1 either And 
neither should an additional freezing of ‘true’ on line 0 
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sensitive terms in the sentence, because all the terms retain their 
original meanings. We have seen that the evaluation procedure pro- 
vides a formal reconstruction of the idea of levels. The line-2 token 
is seen to be one level higher, because its evaluation takes one appli- 
cation of a jump rule, while no jump rule is needed for evaluating 
the first. If one wishes, one can get, as a corollary of the level assign- 
ment to pointers, a level assignment to ‘true’. The prescription is 
simple. Assign to the occurrence of ‘true’ in a token of ‘true(p)’ the 
ordinal number assigned to p; for this is the level that the evaluation 
of p reaches. In particular, the ‘true’ on both lines is assigned level 
0—the number assigned to the line-1 token. But were we to write on 
a third line ‘the sentence on line 2 is true’ (or ‘not true’), that ‘true’ 
would get level 1. 

Occasionally, truth levels are indicated by attaching subscripts to 
‘true’. This would be harmless, provided that we are not misled to 
see in these subscripts, or indices (as they are also called), some form 
of indexicality. The indices play no role in determining truth value. 
They come after the fact, as an annotation to a finished process, an 
underlining of certain aspects of the evaluation. 

While the reference of ‘I’ is picked through the indexical mecha- 
nism from the multitude of humans, nothing of the kind obtains in 
the case of ‘true’. There is no preexisting multitude, or hierarchy, of 
truth concepts, out of which we pick the concept we invoke. The 
hierarchy is created by the evaluation process itself, by the very act 
of judging, by the linguistic exchange. 

The functioning of ‘true’ exemplifies that aspect of linguistic activ- 
ity whereby sentence tokens function as the units of the exchange. 
The meaning of a token is not reducible to what the sentence says, 
even when the denotations of its terms have been fixed. It derives 
also from the token’s particular place in a global setup. 

The principle, under other forms and on different scales, is quite 
common. An event, an assertion, a work of art might mean whatever 
it means by virtue of its place in a global system of similar events, 
acts, or works. This is particularly true in the artistic domain. Du- 
champ’s famous urinal refers to the tradition of art exhibitions. The 
reference, though not formal and not explicit, is there. For the 
place of that particular display within the historic network of art is 
what endows Duchamp’s exhibit with artistic meaning. Moreover, it 
would be erroneous to derive this meaning from the urinal itself, as 
if it were an “‘indexical” that indicates different things in different 
contexts; quite the contrary. That object means or connotes, within 
the displaying event, whatever a urinal means or connotes in every- 
day life. Only by retaining its ordinary, undignified status, can it 
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cause the surprise, the outrage, and the joke that constitute the 
aesthetics of the display. 

We can now take stock and assess the extent to which pointer 
semantics departs from a type-based one. We can use as a sort of 
measure the extent of token—or pointer—dissent caused by the 
functioning of ‘true’ (and not by indexicals, or ambiguities of sorts). 
In this respect, the formal system of the present proposal provides a 
good view, as it excludes indexicals, or any kind of ambiguity. Now, 
the results LPD and HLD mentioned in the “Preview” imply a very 
simple picture, perhaps surprisingly simple—given that it holds for 
full first-order logic and arbitrary pointer networks. Whenever 
pointers dissent, one of them has the value GAP and the other, 
which has a standard value, is one level higher in the linguistic hier- 
archy. Informally, we can say that departure from type-based se- 
mantics occurs just when a pointer fails to express that which an- 
other at a higher level succeeds in expressing.** Thus, departure 
takes place when a necessary ascent in the linguistic hierarchy can- 
not be accommodated within Tarskian semantics or any of its off- 
shoots—and here I include Kripke’s system and all its variants— 
thereby forcing us to change the language. To encompass it within a 
single language, which is what the present framework does, requires 
a price and the results show that we are paying the minimum. 

While the semantic paradoxes have focused a great amount of 
research, resulting in an extended and often sophisticated litera- 
ture, they have not been at the center of the research program of 
natural-language semantics. This is understandable, given the tangle 
of problems encountered in the rather elementary stages of the 
investigation. To give the semantics of ‘true’ while we are far from 
clear on that of ‘event’, or that of mass terms, might seem a luxury.“ 
But the tendency to regard the problems raised by the paradoxes as 
something esoteric, to be handled later (if at all) when more basic 


“ In this respect, all such cases are like the simple two-line example. But the 
sentence in question can be as complex as any first-order sentence and the failure 
can be caused by an extremely intricate loop. Moreover, it can occur at any level, 
fae, Srna ok inlet) an be ok any cardinality ops ‘exceeding that. of the cota) 
netwo 

“ I should add, however, that it 1s conceivable that in more sophisticated vari- 
ants of the algonthm LPD will fail I see no reason why LPD, pleasing as it is, 
ea en rtd GD a eg procedures in eral. With 
indexicals present, different to of the same sentence may get different non- 
GAP truth values. That a similar phenomenon may arise because of deeper rea- 
sons, deriving from the use of ‘true’, is not to be ruled out. 

“ Using the concept of truth in order to give semantics in the form of truth 
conditions, as was proposed by Davidson, is o course quite different from giving 
the semantics of ‘true’ itself. 
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aspects have been clarified, is, I think, an error. We are not con- 
cerned here with marginal brain teasers, but with an essential fea- 
ture of language. The failure of compositional semantics (where 
“composition” is intended for sentence parts) is highly significant. 

Note also that ‘true’ is the paradigm of a network-creating predi- 
cate. It can send us, via ‘what she said’, ‘whatever he asserted’, ‘what 
is written on page 1’, and their like, from John to Joan, from Joan to 
Jack, to Jill, to McX, and so on; the tour is unrestricted and some- 
times endless. ‘Know’ (in the sense of ‘‘know that’’) and ‘believe’ (in 
the sense of “believe that”) have similar potential: ‘I know what she 
knows’, ‘what he believes I doubt’. But ‘true’ is the more fundamen- 
tal.*® It is no wonder that problems arising out of this chain-forming 
capacity have surfaced in the theoretical research of artificial intelli- 
gence. Once it is realized that knowledge, or belief, are to be mod- 
eled as predicates rather than as sentential operants,** the semantics 
of ‘true’ becomes relevant in the contexts of knowledge and belief 
representation. This can be seen in quite a few works published 
mostly in the volumes of TARK. Thus, L. Morgernstern*’ and M. 
Kremmer® draw on Kripke’s model, while N. Asher and H. Kamp*® 
follow the constructions of Gupta and Herzberger. Among other 
works motivated by problems of self-reference in epistemic contexts 
are those of Asher, R. Koons,®! D. Perlis,” J. des Rivieres and H. 
Levesque,” and R. Thomason. 


* Note how often ‘know’, and to lesser extent ‘believe’, ride on the back of 
‘true’. We might say ‘I know that what she says is true’, but we cannot say ‘I know 
that what she say PAIE EEO o a ee e 
Knowledge P y which are used in formal systems, are rendered 
into natural language iyi ‘true’ or an equivalent expression: ‘J knows that 
x is true’ or ‘J t x is the case’. 

* Sentential operants are too restrictive. ‘John knows something that Bull does 
not know’ is inexpressible if ‘know’ is treated as a sentential operant. 

aD iis First Order Theory of Planning, Knowledge, and Action,” in TARK, J 

rn, ed. (Los Angeles: Morgan Kauffman, 1986), pp. 99-115 
wae and Truth, Ph.D. Dissertation (University Cf Pitts 1986). 

ite Knowers Paradox and Repr aeann. Diecnes a Attitudes,” ın 

TARK (1986), pp. 181-48 

“Reasoning about Belief and Knowledge with Self-reference and Time,” in 
TARK (1988), pp. 61-81. Also, “Intentional Paradoxes and an Inductive Theory 
of Propoaidonal Quantiseaton” in TARK, R. Pankh, ed. (Los Angeles: Morgan 
Kauffmann, 1990), pp 11-28 

51 “Doxastic Paradoxes without Self-reference,” in TARK, M. Vardi, ed. (Los 
Angles: Moran Kauffmann, 1979), pp. 29—42. 

i with Self-reference I: Foundations,” Artificial Intelligence, 
XXV (198% 801-22. 

8 “The Consistency of Syntactical Treatment of Knowledge,” in TARK (1986), 
pp. 115—130. 

M “Paradores and Semantic Representation,” ın TARK (1986), pp. 225-39. 
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IV. APROPOS PROPOSITIONS 

Some would have it that truth and falsity attach to propositions (or 
statements)—entities from which linguistic particulars have been 
abstracted away. There is, however, no agreed view on the nature of 
these objects and the difficulties surrounding the concept may in- 
cline us to consider, at least provisionally, less problematic crea- 
tures. In any case and however propositions are conceived, the cru- 
cial question is that of relating them to linguistic constructs—to 
sentences, for example. On the present view, the question takes the 
form: What proposition, or statement,” is expressed by a pointer? 

I have suggested in the “Preview” that, prima facie, a pointer can 
be taken to express whatever is expressed by the sentence it points 
to—except for failures, marked by GAP. On this suggestion, non- 
failed pointers to the same sentence express the same thing. Since 
(by LPD) such pointers get, invariably, the same truth value, we will 
not run into trouble, as far as truth values are concerned. The plau- 
sibility of the suggestion rests, however, not on LPD, but rather on 
its proof, which shows the following. Whenever p and q are non- 
failed pointers to the same sentence, their evaluation requires the 
same sequence of necessary steps, ending at a stage where each can 
be assigned the same truth value by applying the same rule.” This 
indicates that an account that correlates propositions with sentences 
could be extended to pointers. The heuristics would be as follows. 
Two non-GAP pointers express the same proposition, inasmuch as 
this can be said of the sentences they point to. 


“There ıs considerable overlap in the use of the two terms and I am not 
altogether certain as to the “right” one. There is a usage, which I do not follow, 
whereby propositions are taken as the meanings of declarative sentence types, 
including indexcal ones By contrast, statements are attached to particular utter- 
ances In the tradition deriving from Frege and Russell, however, the proposition 
expressed, say, by ‘it ıs now raining’ changes according to time. Read thus, propo- 
sitions can be associated with sentence types only after converting the sentences 
into “eternal” versions What is then the difference between this notion of propo- 
siton and the notion of statement as advocated by J L. Austin and Peter Straw- 
son? It appears that propositions are prelinguustic entities, existing in their own 
night, whereas a statement ıs something one makes by producing a linguistic to- 
ken. Put bluntly, if there were no people, there would have been no statements 
but there would have been propositions. Stripping them of their more metaphysi- 
cal aspects, one can say that propositions are possible statements. And this is how 
I propose to view them here. I have opted for ‘proposition’, because it is usually 
this term which is employed when a formal treatment is intended. Thus, Barwise 
and Etchemendy (op. cit.) propose a formal model of so-called Austinian proposi- 
tions for what Austin, who rejected ‘proposition’, called statements Indeed, if 
propositions are ible statements, the general theory of statements is the 
theory of propositions. 

5 Were we to use another version of the evaluation procedure, a similar analy- 
ms of the procedure would be required in order to justify stipulations about 
“expressing the same thing ” 
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The propositions to be gotten along the suggested lines are “‘onto- 
logical” rather than “epistemic.” A person stating what to him is a ` 
perfectly clear thought might fail to express a proposition, because 
unwittingly he has involved himself in some loop. This is similar to 
the view obtained from some current accounts of propositions. On 
these accounts, certain terms (the so-called directly referring ones) 
introduce their very references as components of the proposition.®” 
Mistaken beliefs about reference may lead to a person’s expressing a 
proposition he had not intended. And when a directly referring 
term fails to refer, no proposition is expressed. 

But nothing prevents us, at least not in principle, from consider- 
ing other kinds of propositions aimed at reflecting finer shades of 
meaning. Just as, for various purposes, it is natural to hide the refer- 
ence-picking machinery of certain terms, it might be desirable, for 
other purposes, to expose it in the proposition. What propositions 
are depends on what dimensions of meaning they are supposed to 
reflect. The question what are the right propositions is as meaning- 
ful as the question what sort of items a map should display, or what 
is the right granularity of a film. 

A move to more epistemic propositions may entail distinctions 
between various gap pointers. The gap constituted by the line-1] 
sentence derives from a contingent fact: the place where that sen- 
tence is written. Someone might have put it there because of a 
mistaken belief about line numbers.™ By contrast, the loop pro- 
duced by ‘What I am saying at this very moment is not true’ cannot 
be amended by moving it, and other utterances, around. Assuming 
that the speaker knows the language, it is difficult to see what state- 
ment of fact he might have intended. The present formalism does 
not distinguish between the two; both are construed as a pointer, p, 
which points to “7r(p). The distinction can be introduced at the 
cost of additional structure: for example, a set of possible pointing 
functions in addition to the actual one.” 

Finally, it can be argued that some meaning is expressed even in 
the case of noncontingent gaps. For a pointer provides a prescrip- 


57 CE. lan, ‘‘Demonstratrves. An Essay on the Semantics, Logic, Metaphys- 
ics, and Epistemol of Demonstratives and Other Indexicals,” in Themes From 
Kaplan, J. Aim aan a and H. Wettstein, eds. (New York Oxford, 1989). 

In an example of Prior, used by Burge, a student who thinks he 1s ın room 9 
writes on the blackboard: ‘Whatever 1s written on the blackboard in room 10 is 
E Hg Sep p thereby his true belief. It turns out that he himself is ın room 

a Family of Paradoxes,” Notre Dame Journal of Formal 
Lage, T, 1 rT 961): 16-32.) 
Distinctions of similar nature, between kinds of pointing, are used in certain 
more sophisticated variants of the evaluation procedure, aimed at reducing black 
holes 
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tion for computing truth values. It amounts to a kind of algorithm. 
And as an algorithm it makes sense, even when it leads inevitably to a 
nonterminating computation. 

While not endorsing, at least initially, any particular position con- 
cerning propositions, the present proposal incorporates a certain 
methodology: ‘consider propositions after having clarified suffi- 
ciently the rules and mechanisms of truth-value assignments. Propo- 
sitions—if and when we introduce them—can be, then, tailored to 
fit the patterns we deem important. To do otherwise is to commit 
ourselves to preconceived notions, which can be interesting and 
fruitful in their domain of origin, but ill-matched to the issues 
at hand. 

An opposite approach underlies the work of Barwise and ‘Etche- 
mendy (cf. fn. 15), some aspects of which I shall now discuss in brief. 
Barwise and Etchemendy propose formal models of propositions, 
constructed according to situation semantics. They classify their 
propositions into Russellian and so-called Austinian ones, with clear 
preference for the latter. One of their goals is to include in the 
model cyclic propositions: those involving self-reference. And their 
ingenious idea is to employ for that purpose AFA (set theory with 
Aczell’s antifoundation axiom)—a theory of non-well-founded sets 
which accommodates, naturally, a variety of “cyclic creatures,” for 
example, a proposition P such that P =‘not-P. 

While the BE (Barwise-Etchemendy) framework assigns truth val- 
ues to propositions, it does not provide guidelines, let alone a pro- 
cedure, for assigning truth values to sentences—either types or to- 
kens. For when it comes to determining the proposition that is ex- 
pressed by a given utterance, a kind of free parameter enters. In 
cases of self-reference there is considerable latitude; much is up to 
the interpreter, who can toggle truth values by suitable adjustments 
in what he can choose. Therefore, such utterances are construed as 
ambiguous. Presumably, Barwise and Etchemendy regard the prob- 
lem of disambiguation (which must be addressed if we are to deter- 
mine truth values) as belonging to some branch of pragmatics. 

It can be shown that the assignment of truth values according to 
the principles of pointer semantics is outside the scope of the BE 
framework. For the needed distinctions cannot be made in AFA. 


al Namely, the sttuation—in the technical sense of situation semantics—asso- 
ciated with the utterance. A situation can be very roughly described as a collection 
of facts. But the complete circumstances of the utterance—including, if you wish, 
the whole world history—fall short of constituting a complete situation For the 
truth or falsity of some arcular proposition is itself a “fact” that the interpreter 
may, or may not, include in the situation. 
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For example, AFA cannot provide for distinguishing between the 
sentence tokens on lines 1 and 2 in the two-line puzzle." 

Can we correlate sentence tokens with propositions of the BE 
kind, whose truth values are the values derived by pointer seman- 
tics? To a certain extent this is possible. It may work in general, 
but it requires the removal of certain restrictions underlying the 
BE modeling of propositions.®* Perhaps these are artifacts of the 
modeling. 

The major nontechnical point of conflict between the BE frame- 
work and pointer semantics is the former’s failure to make place for 
the notion of a failed utterance. It provides the liar sentence and its 
kin with circular propositions, which I do not find objectionable— 
as long as we mark certain propositions by GAP. But, in the BE 
framework, propositions are either true or false. When P = not-P, 
the proposition and its negation are accorded F; which fact is ex- 
plained in BE by the special modeling of negation—clearly not the 
classical concept. Under a different negation concept, proposed in 
BE and called denial, both a proposition and its denial can be true. 
Whatever the insights revealed by these nonstandard negation con- 
cepts, their service in disguising gaps is negative. A picture that fails 
to distinguish certain types of failure can be quite misleading. It is 
also at odds with Austin, who devoted much effort to the taxonomy 
of failures and who distinguished failures caused by illegitimate self- 
reference from mere self-contradiction.® 

V. THREE PREVIOUS ACCOUNTS 
I shall discuss three accounts related to my proposal, those of Par- 
sons (which can be seen as a development of a theme in Russell), 


*! The two nodes in figure 2 of the “Preview” represent in AFA the same set, 
namely, the unique set containing | as its sole member This is forced by the 
extensionality axiom 

e The ition of disjuncte propositions 1s such that all the disjuncts are 
about the same situation—which is also the situation the disjunction is about. A 
similar restriction applies to conjunctions. This has some problematic conse- 
quences. Consider a disjunction $, V S,, where S, 1s a liar sentence and Sy is a 
sentence saying that S, is not true. If both disjuncts must be about the same 
situation, the second disjunct, and along with it the whole disjunction, come out 
nontrue. In pointer semantics this disjunction (..e., a token, or some pointer to it) 
gets the value 7, because the second disjunct does 

© “Those of the so-called ‘logical oxes’. which concern ‘true’ and 
‘false’ are not to be reduced to cases of self-contradiction, any more than ‘S but I 
do not believe it’ is. A statement to the effect that it is itself true is every bit as 
absurd as the one to the effect that it is itself false There are other types of 
sentence that offend against the fundamental conditions of all communication in 
ways distinct from the way in which “This is red and this is not red’ offends .. .” 
Austin, “Truth,” in Philosophical Papers (New York: Oxford, 1961). Fn. in the 
1970 paperback ed, p 129. 
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Burge, and Skyrms.™% These accounts do not assign truth values to 
sentence tokens, but they share with the present work the view that 
places the strong liar and its kin at the heart of the semantic para- 
doxes. Needless to say, I have no intention to review the vast litera- 
ture devoted to the liar. The choice of these works is dictated by 
their relevance to my proposal and reflects no attempt at ranking. 


Ambiguity and interpretation schemes. Parsons proposes to view 
the liar and its kin as cases in which different interpretation 
schemes, based on different truth concepts, are applied to the same 
sentence (type). No formal explication is given to ‘‘interpretation 
scheme,” but a direction is suggested by rewriting ‘A is true’ as 
‘there exists a proposition P such that A expresses P and P is true’. 
The interpretation scheme is then determined by fixing the domain 
of propositions over which the quantified ‘P’ ranges. The idea goes 
back to Russell, who suggested that the paradox arises from an at- 
tempt to quantify over “too many propositions” and is to be re- 
solved by restricting the quantifier ranges. Such a solution underlies 
Russell’s theory of types. Since there are many natural domains that 
can serve as the range for the propositional variable, the sentence 
turns out to be ambiguous. But whatever the domain, it cannot 
include the proposition expressed by the liar sentence itself. 

Intuitively, the interpretation scheme reflects the comprehensive- 
ness of the point of view from which truth-value judgments are 
made. In the case of our two-line puzzle, the sentence on line 1 fails 
to express a proposition, under the first “straightforward” interpre- 
tation; but under a second scheme, which takes into account the 
previous failure, the sentence is seen to be true. Parsons does not 
give an account of how the interpretation is determined, except for 
noting that it is determined by the circumstances of the sentence’s 
utterance. He proposes to regard this ambiguity in the same light as 
the ambiguity surrounding the interpretation of quantifiers or the 
use of ‘say’, ‘mean’, and their like. 

In a 1982 postscript to his paper, Parsons suggests that Burge’s 
indexical conception of ‘true’ can serve to explicate the ambiguity. 


“ Parsons, “The Liar Paradox," Journal of Philosophical Logic, 10 (1974): 
381-412 

Russell, “Mathematical Logic as Based on the Theory of Types,” in Logic and 
Knowledge: Essays 1901-1950, R. Marsh, ed. (New York: Macmillan, 1956), 
pp. 59-102. 

Burge, op cit 

, ‘Return of the Liar, Three-Vahued Logic and the Nature of Truth,” 

American Philosophical Quarterly, vo (1970). 153-61; and his ‘‘Intensional 
Aspects of Semantical Self-reference,” in Recent Essays on Truth and the Liar 
Paradox. 
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But, in the light of the analysis of section III (and of the next sub- 
section devoted to Burge’s work), this would not do justice to the 
intended picture. For the ambiguity of the liar is not to be traced to 
indexical-like shifts in what the ‘true’ that occurs in the sentence 
denotes. If there is ambiguity, it can derive only from different 
points of view that we—the interpreters—adopt from the outside. 

According to pointer semantics, there is no ambiguity. The token 
on line 1 is definitely not true (and not false either); the token on 
line 2 definitely is. But if one were to smudge over the distinction 
between tokens, one could say that the sentence type is ambiguous: 
different instantiations of it get different truth values. One can 
therefore regard different pointers to the same sentence as express- 
ing its meaning under different interpretation schemes. 

There are other systems that can provide formal backups for the 
ambiguity picture. The constructions of Gupta and Herzberger, 
whereby the liar changes its truth value according to the evaluation 
stage, give rise to such a picture. And the more recent work of 
Barwise and Etchemendy, which I discussed above, provides another 
modeling of ambiguity. It is indeed directly related to Russell’s 
ideas, since one of its aims is to model Russellian propositions. 

While these models do not solve the two-line puzzle, they solve or 
illuminate what we might call the one-line puzzle, which is as follows. 
A philosopher contemplates the sentence on line 1, which asserts its 
own nontruth. First he infers that the sentence is not true. Realizing 
on second reflection that the sentence itself “says” just this, he 
infers that it is true after all. He may continue to the third round, 
whereby the sentence becomes nontrue, after which it becomes true 
again, and so on. 


Burge’s proposal. I have already discussed in section II the indexi- 
cal construing of ‘true’. Here I shall confine myself to more specific 
details of Burge’s work. His proposed setup comes in two parts. The 
first consists in a formal language containing a family of truth predi- 
cates, true,,i = 0, 1,..., arranged in a Tarski-like hierarchy, with 
subscripts denoting the levels. Each predicate is to serve as a truth 
predicate for the portion of the language ‘“‘below it,” that is, for all 
sentence types containing truth predicates with a strictly smaller 
index. Yet each predicate takes as arguments all sentence names of 
the language. Because of self-reference there are, for each 1, sen- 
tences that inevitably fail to get a truth, value: neither the sentence 
nor its negation is true,. The idea is to regulate the assignment of 
truth, values so as to approximate, as far as possible, the properties 
required of a truth predicate, while maintaining the predicate’s po- 
sition in the hierarchy. Burge stipulates various plausible axioms; for 


258 THE JOURNAL OF PHILOSOPHY 


example, axioms to the effect that if A is true,, then A, as well as 
true, (A) are true,, for all k > i.% Apparently, the proposed axioms 
are not sufficient.” But this is a technicality that can be amended. 
There is, in fact, a neat generalization of Kripke’s construction to 
languages containing a hierarchy of truth predicates, which achieves 
the desired result in a simple straightforward way. It is mentioned 
briefly in my 1983 paper.®” We can therefore assume that truth, 
values have been assigned in a satisfactory way to all sentences and 
all i. Call this language the stratified language. (Note that a liar,, i.e., 
a sentence saying of itself that it is not true,, is not true, and neither 
is its negation. But a liar, is true, when evaluated from the outside, 
via the usual truth definition. Since each true, behaves like an ordi- 
nary truth predicate for the portion of the language below it, a liar, 
is true, for all k > i.) 

The second part of Burge’s proposal is to rewrite the ordinary 
sentences within a given discourse as sentences of the stratified lan- 
guage; this is done by “indexing,” i.e., by attaching subscripts to the 
occurrences of ‘true’. No precise rules are given; Burge sees it as a 
pragmatic question and provides certain informal guidelines.© The 
recourse to informal pragmatics has brought the proposal under 
severe criticism by Gupta. 

The main trouble with Burge’s proposal, however, is not that the 
pragmatic principles for indexing are imprecise, but that the indices 


® Actually, Burge considers various sets of axioms, but this need not concern 
us here. 
© The following is, I think, not derivable: If trus, (A) ıs true, then A is not 


true,. 
“I found it when, independently of Burge’s work, I was considering formal 
languages that incorporate Tarskian hierarchies. Here it is. Start by evaluating all 
sentences by the standard rules, treating each of the predicates true, as a truth 
i . The standard rules consist pete etal (Tarskian) truth conditions 
as in Kripke’s construction, via Kleene’s strong three-valued logic) as 
as the following rule If A is true,, then trwe,(A) is true, for all z (mcluding the 
case j < il). Having done this, extend the resulting valuation as follows. For each 
A which is not trues, declare the sentence Ttrue,(A) to be true,, for alli > 0 Now 
E faroe age cls 5 ME lie abba rulet. kat only ta ihe ears 
predicates on level > 0. Thus, by the end of the first stage the extension of tru¢g is 
determined. At the end of the second stage, for each A which is not true,, declare 
“true,(A) to be true, for all: > 1. Again, repeat the process for all predicates at 
level > 1. And so on, until we run out of truth p es 
“A principle called “verity” instructs us to assign indices in a way that maxi- 
muzes as pane ee truth-defining schema, that is, in a way that mini- 
mizes gaps. A less important principle called “justice” says that the aig, ep 
should not prefer one statement to another, unleas for some good reason. Thus, if 
each of two persons asserts of the other that he is not saying the truth, both 
should be given—for reasons of symmetry—the same subscript, resulting in both 
being gaps Either principle is subject to a ceteris paribus clause 
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do not do the desired work. An indexing can only lead to truth, 
values; it does not determine at which level a given sentence is to be 
evaluated: Which true, are we employing when we say of a sentence 
A—written already in indexed form—that it is true, or not true, or 
false? Burge invokes here what he calls a pragmatic implicature: 
consider the evaluating sentence—the sentence asserting the truth, 
or nontruth, of A; index ‘true’ in that sentence. That index is the 
level at which A is evaluated. This, however, is only the beginning of 
the story; for occasionally the implicature is canceled in favor of a 
different one. For example, the sentence on line 1 (in the two-line 
puzzle), which asserts its own nontruth, is the evaluating sentence 
for itself. Suppose the ‘true’ there is indexed by i. By the implica- 
ture, when asserting that the sentence is not true, we are—in fact— 
asserting that it is not true,; which, indeed, it is not. But the same 
implicature rules that when we evaluate our assertion, we do so 
again on level i. Therefore, our assertion, expressed by the sentence 
on line 2, is nontrue as well. This unwelcome result is avoided by 
having the implicature cancelled and by raising the level of the sec- 
ond evaluation to some k, greater than i. As Burge sees it, the move 
from line 1 to line 2 signifies a switch from one implicature to 
another. The change of truth value is only apparent: both are true,, 
and both are not true;. 

What Burge fails to observe is that the resort to implicatures voids 
the indexing of its significance. Not only the liar, but all problematic 
cases of self-reference, require implicature switches. Thus, in the 
Plato-Aristotle example quoted from Buridan (cf. section III), the 
‘true’ in both sayings gets the same index, resulting in both being 
nontrue. If a third party were to assert that Plato’s, or Aristotle’s, 
saying is not true, Ats saying would be true. But this can be sustained 
only if the implicature employed in judging the first two assertions is 
changed when judging the third. Or take Nixon-Dean example, 
where each says of the other that all his utterances concerning Wa- 
tergate are not true. If all other Watergate assertions made by each 
are not true, a loop is created and we can say that these two asser- 
tions are not true. Again, the implicature must be changed when 
judging our assertion; because when Nixon’s and Dean’s sayings are 
spelled out as conjunctions, our assertion is seen to repeat a con- 
junct from each. 

In fact, whenever the jump rule is used in the evaluation proce- 
dure, an implicature must be canceled in favor of a different one 


© The example used by Burge is somewhat different, but the difference does 
not matter here. 
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that shifts the level of ‘true’ in the evaluating sentence. Since the 
shifts are cumulative, a count must be kept to determine the “right 
implicature” for choosing the level; and the level can be any finite or 
infinite ordinal. Burge ignores all this. Having introduced implica- 
ture in the analysis of the liar, he goes on in the second half of the 
paper to discuss the stratified language and the indexing, without 
mentioning that matter again. The process of implicature shifts is 
altogether obscure; no guideline, nor any indication, is given. But 
there lies the real problem. For knowing the mechanism that gov- 
erns implicature choice is almost like knowing the evaluation pro- 
cedure. It is this mechanism which does the real work. Indexicality 
rests idle. 


The intenstonal conception. The analogy between an intensional 
case and a strong liar, on which Skyrms draws, is as follows.”° That 
the sum of 7 and 2 is odd is necessary, that the number of planets is 
odd is not—even though the sum of 7 and 2 is the number of 
planets; just so, the sentence on line 2 is true, the sentence on line 1 
is not—even though the sentence on line 2 is the sentence on line 1. 
Skyrms proposes to solve the strong liar by construing ‘true’ as in- 
tensional. Truth values are to be assigned to sentence types under 
descriptions. Under the description ‘the sentence on line 2’, the 
sentence is true; under the description ‘the sentence on line 1’, it 
is not. 

Formally, Skyrms proposes to assign truth values to sentence- 
description pairs (s,d), where d is a description of the sentence 
type s, and to do so in a way analogous to Kripke’s fixed-point 
construction. He shows that, given a partial valuation of sentence- 
description pairs (which may distinguish between pairs that differ 
only in the description coordinate), a variant of Kripke’s construc- 
tion can be used to extend it. Skyrms’s construction falls short of the 
desired goal since, being based on (the analogs of) Kripke’s rules, it 
does not have the gap and jump rules, which are crucial. It works as 
long as we are moving through standard values; e.g., if we have 
already assigned (s, d) the value T, we can go on and assign (Y, d’) the 
value F, where s is the negation of s and ď is some appropriate 
description obtained from d. But it does not provide a way for as- 
signing T to the token on line 2 (i.e., to the pair (s, d), where s is the 
sentence and d is the description ‘the sentence on line 2’). In gen- 
eral, it falls short whenever the jump rule is invoked. 


™ I use our two-line puzzle, the difference between it and the example used in 
Skyrms does not matter here. 
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Now, if desired, one can regard the pair (s,d) as a pointer to s. 
And, vice versa, a pointer to s can be changed to a pair (s, d) where d 
is some description derived from the pointer. Formally, my algo- 
rithm can be viewed as providing a truth-value assignment to sen- 
tences under descriptions, achieving thereby Skyrms’s goal.” I do 
not see, however, how the failure of substitutivity arising in the 
strong-liar puzzle can be accounted by intensionality, without 
stretching ‘“‘intensionality” so as to render it uninteresting. For, by 
the same token, one could subsume under it all other context- 
dependent phenomena, including, for example, indexicality. When 
each of Jack and Jill says, “I am tal,” but only Jack is tall, we could 
explain the truth-value difference by the intensionality of ‘true’, 
namely, ‘I am tall’ is true, under the description ‘the sentence ut- 
tered by Jack’, but not true under the description ‘the sentence 
uttered by Jill’. I doubt that we want to adopt this picture. 

Intensionality is affected by the way in which referring terms pick 
their references. The sum of 7 and 2 must be the number 9, but it is 
only a cosmic accident, or 80 we think, that 9 is also the number of 
planets. But nothing in the liar situation hinges on some unclarity in 
the referential picture. The intensional effect disappears if the refer- 
ential connection is made explicit. Given that the number of planets 
is the sum of 7 and 2, it is necessary that the number of planets is 
odd. Or, given that George IV knew that the author of Waverly was 
Scott, it is inconceivable that he wondered whether Scott wrote 
Waverly. But can we say that, given that the sentence type on line 
2 is the sentence type on line 1, the sentence on line 1 is true? Cer- 
tainly not. 

Viewed in retrospect, both conceptions of truth—as an indexical, 
and as an intensional notion—are attempts to account for phenom- 
ena arising out of liar-like paradoxes by using tools that fall essen- 
tially within the framework of traditional logic. Both come short of 
providing a solution, but for different reasons. 


HAIM GAIFMAN 
Columbia University 
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IS IT REASONABLE TO REGRET THINGS ONE DID?* 


' J eall know regret. Whether we take ourselves to be moral 
W ress or immoralists, and, if the former, whether we 
have an Aristotelian, Kantian, or utilitarian understand- 
ing of morality (or no clear understanding at all), sometimes we 
regret what we did. The object of the following is not to demarcate 
what it is to regret things one did, to describe how it feels, or to state 
the Pree conditions under which regret does and does not ap- 
pear.’ A rough explanation should suffice: regret is a painful feeling 
about something we did which we think was bad. More specific kinds 
of phenomena falling under this explanation, like remorse, repen- 
tance, feeling of guilt, will not be discussed in particular. Nor is the 
diversity of reasons for thinking it was bad of interest here: one can 
regret a purchase because it was too expensive, one can regret hav- 
ing behaved incorrectly because it diminishes one’s standing, one 
can regret a mean remark because it hurt the other person; though 
it is the last kind of case, regret over something taken to be wrong 
for moral reasons, which will be in the foreground in what follows. 
The question is this: Given that we did something bad, is it reason- 
able for us to regret that we did it? The word ‘reasonable’ is to be 
understood here in its everyday sense, roughly equivalent to phrases 
like ‘it’s a good idea’, ‘it makes sense’, ‘it is recommendable’. Thus, 
the question can also be expressed as follows: Should we regret bad 
things we did? 

It will be wondered whether the question itself is a reasonable 
one. After all, you feel regret, and what you feel is not up to you. To 
ask whether something is reasonable, however, makes sense only if it 
is up to you to do it or not. So the question whether regret is 
reasonable is mistaken right from the start. You might as well ask 
whether it is reasonable to have a headache. 

This raises a large problem: How is what we feel related to the rest 
of our mental life, and how is it related to action, in particular to 


* For helpful criticism of revious versions of this text I am grateful to Celéne 


Abramson, Thomas P e : Rorty, Nancy Schauber, and an audience at 
the Unversity of of Bielefeld, German 

! See Amélie O. Ro eg ee in Rorty, ed , Explaining Emotions 
(Berkeley: California ), pp- 489-506; Gabriele Taylor, “Guilt and Re- 


morse,” in Taylor, Pride, home and Guilt: Emotions of Self-assessment (New 
York: Oxford, 1985), pp. 85-107. 
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moral action?? Without addressing this large question, a preliminary 
answer to the objection at hand may be given as follows. To ask 
whether something is reasonable makes sense not only in those cases 
where doing it or not is up to you in the sense that you can do it 
right on the spot. It is true that in this sense it is not up to us to 
regret things we did. We cannot at any given moment turn off the 
regret we feel about something we did, nor can we turn on a regret 
we do not feel. Yet over longer periods of time we learn to feel a 
certain way, and we learn not to feel a certain way any more, on this 
or that sort of occasion. This holds for individuals and for groups of 
people. For example, a feeling of reverence has passed out of use in 
my life. I did feel reverence for some people when I was much 
younger. Now I do not; and it seems this is due, not to a scarcity of 
suitable objects, but to my having outgrown this feeling altogether. I 
have lost this language; I do not just fail to find people with whom to 
speak it. Similarly, over the last few centuries people in the Western 
world have lost the feeling of awe. It still has a prominent place in 
Hobbes (and there is no reason to think that he is archaic on pur- 
pose here), but it has dried out since. The massive violence of the 
twentieth-century state inspires terror, not awe: there is nothing 
sacred about it. Now, it would be a long story, both in the individual 
and in the larger case, to explain how it happened that these feelings 
fell into desuetude. But it is very unlikely that a convincing story 
could be told, in either case, without referring to what people came 
to think, to see, to recognize in the process. People do not learn, or 
unlearn, to feel something just by being pushed around; it is also a 
matter of what they think about what happens to them.’ For exam- 
ple, it was precisely Hobbes’ own construction of an awe-producing 
political mechanism which, among numerous other influences, con- 
tributed to awe’s becoming obsolete. And it is clear that I did not 
lose reverence for people like a key from my pocket. It was part of a 
whole process in which I came to have a different view of people 
who were important to me, and of myself, too. So in this sense some 
of our feelings are indeed up to us: whether some kind of feeling is 
available in a situation or not depends on what we have become and 


? There is a pe literature on these topics. To mention only a few titles Rorty, 
. cit.; William Lyons, Emotion (New York: peas 1980), Ronald de Sousa, 
tonality and the Emotions (Cambridge: MIT, 1987); Patricia S. Greenspan, 
Emotions and Reasons: An Inquiry into Emotional Justification (New York. 
1988). 
* The point here is the tive version of Epictet’s famous line that what con- 
fuses people are not the thi but what they think about the things. 
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how we see things. In particular, then, it will depend, among other 
things, on whether we judge that feeling to be reasonable or not. 
Again, it is not that a feeling judged to be unreasonable is thereby 
and in that moment annihilated.* What sometimes happens is rather 
that a feeling judged to be unreasonable slowly recedes and eventu- 
ally dies out, just because it makes no more sense to us. To be sure, 
this happens only sometimes: some feelings live on stubbornly in the 
face of better judgment. But sometimes they yield. Feeling is not in 
principle insensitive to insight. Perhaps it is here that the moral 
philosopher’s task lies. He is not particularly effective in reforming 
action right on the spot in accordance with better insight. But he 
may, in the longer term, help people in learning to find sense, or to 
find no sense, in what they feel. Thus he may help in reaching a 
more sensible way of feeling and so in the end, perhaps, a more 
sensible way to act. 

Spinoza raised the suspicion in his Ethica that to regret what one 
did is not reasonable: ‘‘Repentance is not a virtue, i.e. it does not 
arise from reason. Rather, he who repents what he did is twice 
miserable, i.e. impotent” (pt. IV, prop. 54). It is not the claim ex- 
pressed in this sentence, however, which is under discussion here. 
That Spinoza speaks about repentance rather than regret does not 
make an important difference: his point, if he has one, applies to the 
latter phenomenon as well. The important difference lies in the 
concept of reason. For Spinoza, reason consists in the possession of 
adequate ideas about how things are." This is a much narrower con- 
cept than the everyday notion indicated above. His claim that repen- 
tance, or regret, is not reasonable thus becomes uninterestingly 
weak. That we do not feel regret insofar as we entertain adequate 
ideas about ourselves and people and things around us, that is, inso- 
far as we consider everything from the viewpoint of eternity (ibid., 
prop. 44, corollary 2). is easy to grant. The interesting question is 
whether regret has a place in the broader field of what a reasonable 
human being does and feels, with ‘reasonable’ being understood in 
its ordinary sense. Spinoza’s proposition does not address this ques- 
tion, but invites the reader to ask it. 

Moreover, Spinoza suggests an argument for a negative answer. 
‘To regret what one did is to be twice miserable’ means: a person is 


t Hume seems to be supposing something as strong as this. “The moment we 
percerve the falsehood of any supposition, or the insufficiency of any means our 
ems to our reason without any o ition”; A Treatise of Human Na- 
ture, L. Bigge and P. Nidditch, eds (New York: Oxford, 1978), p. 416. 

® Ethica, Pt. II, prop. 40, scholium 2 
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first miserable because he did something bad, and he is miserable a 
second time because he is in pain over what he did. “To be miser- 
able’ must be understood here in an objective sense, not as ‘feeling 
bad’, but as ‘being in a bad way’. This is so because the first misery is 
not a case of feeling bad. It is not, for otherwise it could not be 
distinguished from the second one. Rather, feeling bad is one kind 
of being in a bad way, which is why both the first and the second 
misery can indeed be called “misery.” (Spinoza hints at this in add- 
ing the phrase ‘i.e. impotent’.) What is under discussion is the state 
of an organism that is objectively ‘low’. Still, one way for an organ- 
ism to be low is to feel low. So what Spinoza is saying is this. The 
person who regrets what he did is in a bad way twice. Whoever did 
something bad is thereby, independently of how he feels about it, 
badly off. The pain he feels about what he did is for him to be badly 
off a second time. Thus, Spinoza’s argument is: it is not reasonable, 
because one did something bad, to go and make things worse. But 
that is what regret is, double misery, the second for the sake of the 
first. So, regret is not reasonable. The same argument, incidentally, 
occurs in Nietzsche: “Never give room to repentance, but tell your- 
self at once: this would mean to add a second piece of stupidity to 
the first.’”® 

The argument also gives a clearer idea of the nonregretting per- 
son Spinoza has in view. It is not because this person never goes 
wrong or never admits to having gone wrong that he does not re- 
gret. Accordingly, Spinoza is not uselessly advising never to have 
grounds for regret. The reasonable person without regret has done 
something bad, and for the sake of the present argument it should 
be supposed that we all agree with him on this. So he is not one of 
those famous people who decline to regret because they do not 
recognize the law under which they appear guilty, from Don Gio- 
vanni’s “No!” when the Commendatore asks him to repent, to Edith 
Piaf’s “Non, je ne regrette rien.” Unlike them, Spinoza’s reasonable 
person is indeed once miserable. He is not immoral, nor is he bound 
to be heartless. He sees that what he did was wrong and he may be 
perfectly aware of the suffering he inflicted on others. He just does 
not grieve. Thomas Aquinas’ held that the saints in the new life after 
judgment will be displeased by their earlier sins, but “without any 


© The Wanderer and His Shadow, 323. A similar sentiment is expressed in 
Ralph Waldo Emerson's essay “Self-reliance,” as quoted by Rorty in “Agent Re- 
gret,” p. 501. 

7 Summa Theologtae, III, qu. 84, art. 8 
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sadness.” These are sensible saints, Spinoza is saying, let us be like 
them, right now in this life. 

The opposite view has been put forward by Bernard Williams: “it 
would be an insane concept of rationality” which insisted that a 
rational person would never experience sentiments of agent regret 
(ibid., p. 29). In fact, here and in other passages Williams has a 
stronger claim in mind than the one under discussion here, and 
what he says is not specifically directed against the Spinozistic view 
just sketched. Williams holds that regret is sometimes appropriate 
not only with regard to intentional actions, but also with regard to 
consequences of action which the agent did not intend and could 
not even foresee, so that the field of potential regret extends to 
“almost anything for which one was causally responsible in virtue of 
something one intentionally did” (ibid., p. 28). Williams holds this 
partly because it allows him a respectable concept of tragedy. It is 
true that tragedy presents us with extreme cases.’ Still, what is cen- 
tral to tragedy is central to our life, too: the acceptance of things 
that just happened as part of what one has done and, thereby, of 
what one is.'° Neither accident nor the machinations of the gods 
lead Oedipus into disaster, but rather what he did. It is what he did, 
for in the important sense “it was him,” even though he did it 
unwittingly and unwillingly. This, evidently, is a far-reaching and 
quite doubtful thesis. Williams asks rhetorically: “Could we have, 
and do we want, a concept of agency by which what Oedipus said 
[namely, that he did not do it] would be simply true, and by which he 
would be seeing things rightly if for him it was straight off as though 
he had no part in it?” (ibid., p. 30 fn.) But that is just to frighten the 
children. The answer to this question should simply be, “yes.” 
Agency requires intentionality. In any case, what is under discussion 
here is not the strong thesis that regret is in order even with respect 
to unintended consequences of actions, but the weaker one that 
there are some cases where regretting what one did is reasonable. 
Given Spinoza’s challenge, the thesis needs reasons. 

Williams does not state his reasons explicitly, so an argument has 
to be constructed. The most natural consideration may be this: to 
regret something bad that one did is reasonable because it makes it 
more probable that one will do better in the future. There may be 


$ “Moral Luck,” in Moral Luck (New York: Cambridge, 1981) 
° Willi Ethical Consistency,” in Problems of the Self (New York: Cam- 
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IS IT REASONABLE TO REGRET? 267 


some hints in this direction in Williams’s texts,!! but it does not seem 
to be the reason on which his claim is based. Nor would it be a good 
reason. Regret does not in fact make doing better in the future 
more probable. The impression that it does arises from a confusion 
of Spinoza’s reasonable person who does not regret with an immoral 
or heartless one. No doubt somebody who does not see anything 
wrong in what he did or has no sense for the suffering he caused is 
likely to do similar things again. That is not the case of the reason- 
able person who does not regret, however. He understands what he 
did, he knows that it was bad, it just does not pain him. And there is 
no reason to assume that it is simply being pained by what one did 
which makes doing better in the future more likely. Failure to learn 
from what went wrong comes in many forms. In moral matters there 
is the self-complacency of feeling terrible about what one did and 
leaving it at that, as well as the impassiveness of going on with busi- 
ness as usual. There are cases in all four fields of the two distinc- 
tions: there is regret with improving and without improving, and 
there is absence of regret with improving and without improving. 
That the ratio of improving to not improving is higher under re- 
gret than under absence of regret is not a plausible or justified 
assumption. 

Second, regret could be defended with the idea of atonement. He 
who did evil must suffer evil, otherwise things will not return to 
order. Since the normal course of the world cannot be relied upon 
to make this happen, it is a good thing for the agent himself to pay 
for what he did by grieving about it. If he only registers that what he 
did was bad and even does everything he can to prevent his doing it 
again, he seems to get off the hook too easily. This leads to the idea 
of a regret that cleanses the heart: sincere grief can balance, and 
thereby remove, the agent’s guilt. In this way, regret promises libera- 
tion. Suffering through what he did, the agent becomes free of the 
burden of the deed and can start afresh. 

This defense has no chance to succeed, nor is there any indication 
that Williams would want to use it. Atonement is itself not a reason- 
able idea and cannot show regret to be reasonable. That he who did 
evil should suffer evil is neither required nor even desirable. How- 
ever deeply it may have been inculcated in us that it ought to be the 
case that the good be happy and the evil suffer, the truth is that, if 
anything, it ought to be the case that everybody, good and evil alike, 


n “Ethical Consistency,” p. 175; “Politics and Moral Character,” m Moral 
Luck, p. 62. 
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be happy. The suffering of him who did evil is only a further evil; it 
does nothing to remove the first. That the agent should, if possible, 
repair the damage he caused or recompense for it is another matter. 
Atonement is taken to eliminate, not the damage, but the guilt in- 
curred in causing the damage. But this is a mythical idea. That we 
did what is bad cannot be undone. The promise of liberation 
through regret is a false promise. We should free ourselves, not of 
the burden of the deed, but of the idea that the deed is a burden. 

The arguments considered so far have parallels in the discussion 
of punishment, in the attack on punishment as a mere duplication of 
suffering, in the defense of punishment as a way to make betterment 
more probable, in the defense of punishment as atonement. Not so 
the following consideration, which probably comes closest to being 
the argument on which Williams’s thesis is based. Having made the 
point, as quoted several paragraphs back, that a concept of rational- 
ity which excludes regret over what one did would be an insane one, 
Williams continues as follows: 


To insist on such a conception of rationality, moreover, would, apart 
from other kinds of absurdity, suggest a large falsehood: that we might, 
if we conducted ourselves clear-headedly enough, entirely detach our- 
selves from the unintentional aspects of our actions, relegating their 
costs to, so to speak, the insurance fund, and yet still retain our identity 
and character as agents.!? 


This statement is meant to support Williams’s stronger thesis that 
regret is appropriate even with regard to unintended consequences 
of actions. The reasoning is: not to regret the unintended conse- 
quences of actions is to detach oneself from them, but in detaching 
oneself from them one fails to retain one’s identity and character as 
an agent. The reasoning should apply with all the more force to the 
core area of action, intentional action, which is what is under discus- 
sion here. Not to regret what one did, Williams is saying, is to detach 
oneself from one’s actions, but that is to lose one’s identity and 
character as an agent. 

So the reasoning is not: failure to regret would amount to a de- 
tachment from what one did, but that is impossible, so it makes no 
sense to try. Detachment is possible. What is wrong with it, accord- 
ing to Williams, is only that it is too costly; morally costly, that is. 
You pay detachment with your identity and character as an agent. 


12 “Moral Luck,” p. 29. 
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The present argument, then, like the previous ones, tries to defend 
regret by indicating a benefit of it which is overlooked in Spinoza’s 
challenge. Regret, the first argument said, makes better action in the 
future more likely. That did not appear plausible. Regret, the sec- 
ond argument said, obliterates the wrongdoing. That appeared 
false. Regret, now the third argument is saying, allows us to retain 
our identity and character as agents. This is unclear. We need to 
know three things. What is meant by ‘retaining our identity and 
character as agents’? Why is it desirable to retain them? And why 
does it take regret to retain them? 

The expression ‘our identity and character as agents’ cannot 
mean “our character of being agents.” On this reading, Williams 
would claim that by not regretting one ceases to be an agent alto- 
gether, which is quite implausible. On the other hand, ‘our identity 
and character as agents’ cannot mean what is normally simply called 
“character,” our individual qualities as they are manifested in how 
we act. On this reading, Williams would claim that failure to regret 
leads to the loss of one’s character traits. That again is implausible. 
A generous, courageous, or short-tempered person does not stop 
being any of these things just by not regretting what he did. Rather, 
what is meant must be something like this. We are the agents we are 
not just by having done what we did. We are the agents we are by 
accepting these doings as ours. This is what the person who does not 
regret fails to do. He sheds what he did rather than identifying with 
it. He steals away out of his own deed. He does not doubt that he 
did it; rather, the point is that he does not positively take it on as his. 
What he thereby loses is after all, in one sense and to a certain 
extent, his character of being an agent. He loses it not in the sense 
that henceforth he is barred from agency. He loses it with respect to 
the past. He loses it in the sense that something he actually did gets 
cut out from what he is as an agent, given that what he is as an agent 
depends in part on what he accepts as his. “Where It was, there I 
shall become,” Freud wrote. Where I was to be, there It is—this is 
what failure to retain one’s identity and character as an agent 
means. It is an agent’s crippling himself as an agent, by surrendering 
something he did to the realm of what just happens through him, 
rather than incorporating it in what he is as an agent. So what Wil- 
liams is saying about the person who does not regret is that, in a 
specific sense, he lacks character. Not that he is characterless, i.e., 
bland. He lacks character the way a person does who, on encounter- 
ing any sort of resistance, abandons people and projects to which he 
was committed: the person without regret abandons his deeds. 
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In short, Williams is Polonius: “This above all; to thine ownself 
be true.’ 

Why, really? This is the second of the questions distinguished 
above, about the desirability of retaining your identity and character 
as an agent. If failure to identify with what you did is held to make 
lack of regret morally costly, indeed on Williams’s view prohibitively 
costly, it needs to be explained what is good about identifying with 
what you did. Why not throw your identity and character, in this 
sense, to the winds? There seems to be only one answer. You should 
identify with what you did because you did it after all. It is the truth 
about you that you realize in acknowledging agency. To thine own 
self, expressed in your actions, be true, for that is what you re- 
ally are. 

This is not a good answer, given the terms of the question. If your 
identity and character as an agent is something you may retain or 
fail to retain, then the option of retaining cannot be recommended 
on the grounds that thereby you acknowledge the truth about you. 
About this purported truth you are just called upon to decide, or 
else the question about retaining or not does not arise. If you are in 
the position to retain or not, then you are not in the position to stay 
true to yourself by either retaining or not retaining. “You must 
become who you are,” Nietzsche said,’* but the pathos of truth to 
oneself rests on an assumption, namely, that there is such a self to be 
true to, which it precisely undercuts, namely, by understanding 
truth to oneself as merely one option among others.’® Thus, Wil- 
liams’s warning that by not regretting we fail to retain our identity 
and character as agents turns out to be another case of frightening 
the children. There is nothing detectably wrong with a failure to 
retain one’s identity and character as an agent, in the sense of failing 
to identify with what one did. There is nothing detectably wrong 
with simply walking away from it. To be sure, we are not talking here 
about denying that one did it, in the ordinary sense. The person 
without regret and his opponent are agreed on the facts of what 
happened. The latter only claims that there is, beyond the trivial 
knowing that one was the doer of this deed, an acceptance of one’s 
having done it as part of what one is, as part of one’s identity, as this 


13 Wilham Shakespeare, Hamlet, I 3. 
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is often expressed;'® and that such acceptance is what is required of 
agents. This claim is what is unfounded. We can live, and perhaps 
live better, without identity and character in this sense. Sure, we will 
have identity and character in the ordinary sense. We will be the 
individual agents we are, with these particular features manifesting 
themselves in action. The fact that we have identity and character in 
this sense, however, lends no support to the demand expressly to 
retain identity and character, in the special sense, through identifi- 
cation with what we did. Identity and character in the ordinary sense 
we will have in any case, so we may identify or not with what we did 
as we please. And sometimes it will make sense not to identify, and 
to say: “True, I did that, I even did it intentionally. But I am no 
more in it. You cannot find me there. That is history. It does not 
cast a shadow over what I do now.” No doubt, somebody talking like 
that may be deceived and may, by anxiously denying the shadow, 
rather confirm its presence. The point is that this need not be the 
case. He may truly be free. 

Suppose, for the sake of the argument, that all of that is wrong. 
Suppose, that is, that there is a sense of “our identity and character 
as agents” which makes the demand to retain our identity and char- 
acter neither unjustified nor superfluous. That leaves us still without 
an answer to the third question, namely, why the retaining needs 
regret about the bad things one did. Why should not remembrance 
and understanding suffice, remembrance of what was done and un- 
derstanding of what was bad about it? Why grieve in addition? 

The best answer in defense of regret may be this: there is no 
understanding that the thing was wrong without the grief about 
having done it. This is not the cynical wisdom that “they won’t re- 
member what hasn’t been beaten into them.” The point is: to know 
that one did it and that it was bad is by itself painful. Spinoza may be 
right in saying that he who regrets is twice miserable. What he over- 
looks is the fact that the second misery is linked to the experience of 
the first. Thus, the only alternative to regret is simply to be blind to 
one’s failings, and that this is not a desirable attitude is obvious. 
“Suffering we recognize that we did wrong,” Antigone says.’” The 
thought here is: only in suffering we recognize it. This is not evident. 


18 See again Williams, “Moral Luck,” pp. 29f.: “if one attaches portance to 
the sense of what one is in terms of what one has done and what in the world one 
is responsible for, one must accept much that makes its claim on that sense solely 
in virtue of its bemg actual ” 
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It is not evident that one could not see, in full clarity but without 
grief, what one did wrong. In fact, the contrary may be true: that 
with grief one could not see in full clarity what one did wrong. After 
all, to see such a thing is itself an achievement and, hurt, torn, and 
dejected by regret, we may not be capable of doing it. However this 
may be, the case for the necessity of regret for recognizing one’s 
failing has not been made. 

“Aren’t you really returning now,” it may be objected, “to Spi- 
noza’s understanding of reason, instead of staying with the ordinary 
one, as promised? Full clarity about what one did wrong, unclouded 
by grief, looks suspiciously like considering things from the point of 
view of eternity, not like something which it is a good idea to reach.” 
Maybe this is what Spinoza meant by a consideration of things from 
the point of view of eternity: without pain just to get clear about 
matters. (Though ‘eternity’ speaks against this reading and suggests, 
rather, that this point of view lies beyond the human field.) If he 
meant that, then it is true, we are back with Spinoza’s understanding 
of reason here. But it is not true that this is to quit the ordinary one. 
To look things straight in the face, unburdened by grief, is just a 
very good idea, and a good idea in the same sense in which it is a 
good idea, say, to have the snow tires mounted before the first 
snowfall. True, there is this difference: some people do mount their 
snow tires in time, but few people are likely to take in their failings 
without regret. Still, this shows only that some forms of unreason 
are more common and harder to eradicate than others. As discussed 
earlier, it does not show that this is a form of unreason we are stuck 
with once and for all. 

“Your argument proves too much,” it will be said. “This reason- 
ing can be applied not only to regret of things done, but also to 
regret of things lost and above all to grief and mourning over people 
who are no longer with us, whether they died or went away. Let us 
not quarrel about whether it is possible to do without these emo- 
tions. Let us grant your point for now that a suitable éducation 
sentimentale may in the end eliminate them. But is that a desirable 
state? Imagine a man saying: ‘Yes, it is a bad thing, we broke up, 
even though I had loved her from my heart; but it does not hurt me, 
I am alive and well as ever’. Isn’t this a terribly cold, cruel, inhuman 
reaction? Something must be wrong with an idea of being reason- 
able which recommends such brutishness.” There need not be any- 
thing cruel or brutish here. True, we often take such a reaction 
rightly as evidence that he did not love her so much after all or that, 
if he did, he now has lost, or indeed destroyed, his sense for why he 
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loved her. But it need not be so. We may know independently that 
he did love her deeply and that his present reaction is not owed to 
his being blind to her. Also, a man speaking like this will often be 
speaking falsely, either hiding his pain from others or deceived 
about it himself. Again, this need not be so, however. And if every- 
thing is as he says it is: love, loss, and no wound, then so much the 
better. (“If you were the woman with whom he broke up, would you 
still say that?” Yes, especially then, I hope.) We do grieve about 
separations, but there is nothing recommendable or praiseworthy in 
grieving. Lf you can do without it, do not think you rather ought to 
grieve. 

Or somebody may argue ad hominem: “I gather you are German. 
So obviously you are engaged in a little whitewash. Stripped of the 
philosophical disguise, your message is that Germans should stop 
doing penitence for what they did under the Nazis. We should all 
forget about the past, wipe the slate clean and turn to the business at 
hand as if nothing had happened. Alexander and Margarete Mit- 
scherlich!® lamented an inability to mourn, as they called it, in the 
German attitude to their past. You just give that defect a good 
name.” There is a misunderstanding here. I am saying that peni- 
tence, in the sense of grieving over what one did, and mourning are 
not recommendable attitudes. I am saying that the Nazi who did the 
most horrendous things would still be wise to try, undejected by 
what he did, to do what now is right. To speak against penitence and 
mourning, however, is not to say that the events which penitence 
and mourning are about are any less bad. Nor is it to recommend 
forgetting them. On the contrary, it is with mourning and regret left 
behind that we may come to see what we did in the sharpest light. 
We should not suffer from what we did, we should face it. 

RUDIGER BITTNER 
University of Bielefeld 
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NOTES AND NEWS 


The Department of Philosophy at Rice University is pleased to announce 
the appointment of George Sher as the Herbert S. Autrey Professor of 


The Department of Philosophy at Virginia Commonwealth University is 
pleased to announce the appointment of Eugene Mills as Assistant Pro- 
fessor. : 


The Department of Philosophy at the University of Waterloo is pleased to 
announce the appointment of Paul Thagard as Professor 


Joan C. Callahan has been elected Executive Secretary of the Society for 
Philosophy and Public Affairs. Please address all correspondence with the 
Society to Prof. Callahan, Phil. Dept., Univ. of Kentucky, Lexington, KY 
40606-0027. 


The National Council for Critical Analysis seeks a person or institution 
willing to take over The Journal of Critical Analysts and/or The Journal 
of Pre-College Philosophy. Anyone interested should communicate with 
Dr. Pasqual S. Schievella, NCCA, P.O. Box 137, Port Jefferson, NY 11777. 
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critics. The editor of each volume will write an introduction and prepare an 
extensive bibliography, in addition to selecting previously published essays 
and commissioning new contributions. The series will begin with a volume 
on Plato edited by Nancy Tuana. Other early volumes will focus on Aris- 
totle, Locke, Marx, Wittgenstein, De Beauvoir, Foucault, and Derrida. In- 
quiries should be directed to Prof. Nancy Tuana, School of Arts and Hu- 
manities, Univ. of Texas/Dallas, Box 830688, Richardson, TX 75083-0688. 


The Department of Philosophy at the Virginia Polytechnic Institute and 
State University is pleased to announce that it will be offering a course of 
study leading to the M.A. in Philosophy, beginning in the Fall. Special areas 
of concentration include History of Philosophy, Philosophy of Science, and 
Ethics. The Univ. and Dept take affirmative action very seriously. We are 
especially mterested in receiving applications from women and people of 
color. Please direct inquiries to Graduate Program, Phil. Dept., Virginia 
Tech & State Univ., Blacksburg, VA 24061-0126. 
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INDISPENSABILITY AND PRACTICE* 


or some time now, philosophical thinking about mathematics 

has been profoundly influenced by arguments based on its 

applications in natural science, the so-called “‘indispensability 
arguments.” The general idea traces back at least to Gottlob Frege,’ 
but contemporary versions stem from the writings of W. V. Quine,” 
and later, Hilary Putnam.* Much contemporary philosophy of math- 
ematics (including my own) operates within the parameters of the 
indispensability arguments; they are called upon to motivate various 
versions of nominalism, as well as to support various versions of 
realism. Still, attention to practice, both scientific and mathematical, 
has recently led me to doubt their efficacy. I shall try to explain 
these doubts in what follows. If they are legitimate, we will be forced 
to rethink much of current orthodoxy in the philosophy of mathe- 
matics. 

I NATURALISTIC BACKGROUND 

An argument based on scientific and mathematical practice can 
only succeed from a sufficiently naturalistic perspective. Familiar 
Quinean naturalism counsels us to reject prescientific first philo- 
sophy in favor of an approach that begins within our current best 
scientific theory: 


* My thanks go to the NSF (DIR-9004168) and to the UC/Irvine Academic 
Senate ittee on Research for their support. 

. it is applicability alone which elevates arithmetic from a game to the rank 
of a science,” P. Geach and M. Black, eds., Translations from the Philosophical 
Writings of Gottlob Frege (Cambridge: Blackwell, 1970), p. 187. Although Frege 
speaks of arithmetic, my focus here will be on analysis, b construed 

? See, e.g., “On What There Is,” repr. in From a Logical Point of View, 2nd 
ed. (Cambridge. Harvard, 1980), pp 1-19; and “Carnap and Logical Truth,” 
repr. in The Ways of Paradox, rev. ed. (Cambridge: Harvard, 1976), ee ein 

See, e.g., “What Is Mathematical Truth?” and ‘‘Philoso rig seer ras 
Mathematics, Matter and Method, 2nd ed (New York. ridge, Sars 
60-78, 328-57. 
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... naturalism: abandonment of the goal of a first philosophy. It sees 
natural science as an inquiry into reality, fallible and corrigible but not 
answerable to any supra-scientific tribunal, and not in need of any 
justification beyond observation and the hypothetico-deductive 
method ... The naturalistic philosopher begins his reasoning within 
the inherited world theory as a going concern. He tentatively believes 
all of it, but believes also that some unidentified portions are wrong. 
He tries to improve, clarify, and understand the system from within. 
He is the busy sailor adrift on Neurath’s boat.‘ 


Epistemological studies, in particular, are to be carried out within 
science, with the help of relevant psychological theories. 

From the perspective of this scientific naturalism, a philosopher 
can criticize scientific practice, but only on scientific grounds, as a 
scientist might do, for good scientific reasons. This is enough to 
ratify an appeal to scientific practice in philosophical contexts: be- 
cause scientific practice can only be questioned on scientific 
grounds, a conflict between scientific practice and philosophy must 
be resolved by revising the philosophy. So, for example, if scientific 
practice holds that p does or does not count as evidence for q, to 
disagree on philosophical grounds is an offense against naturalism. 

As we shall see, however, it is not clear that Quine intends to 
extend this naturalistic faith in practice to the practice of mathemat- 
ics. Leaving Quine aside for the moment, we must ask ourselves 
what the role of the philosophy of mathematics should be. Mathe- 
matics, after all, is an immensely successful enterprise in its own 
right, older, in fact, than experimental natural science. As such, it 
surely deserves a philosophical effort to understand it as practiced, 
as a going concern. Indeed, as in any discipline, there remain con- 
ceptual confusions in mathematics that might be clarified by philo- 
sophical analysis, providing that analysis is sensitive to the realities of 
actual mathematics. If it is to serve these purposes, a philosophical 
account of mathematics must not disregard the evidential relations 
of practice or recommend reforms on nonmathematical grounds. 

These are, in my view, proper goals for the philosophy of mathe- 
matics. We, as philosophers of mathematics, should provide an ac- 
count of mathematics as practiced, and we should make a contribu- 
tion to unraveling the conceptual confusions of contemporary math- 
ematics. So it is against this backdrop that I shall assess the 
indispensability arguments, or rather, the view of mathematics the 
indispensability arguments generate. They will be judged by their 
ability to account for actual mathematics as practiced. 


1 Quine, “Five Milestones of Empiricism,” in Theories and Things (Cam 
bridge: Harvard, 1981), pp. 67-72, here p 72 
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U INDISPENSABILITY 
The original indispensability arguments were aimed at those who 
would draw a weighty ontological or epistemological distinction be- 
tween natural science and mathematics. To those tempted to admit 
the existence of electrons while denying the existence of numbers, 


Quine’ points out that 


Ordinary interpreted scientufic discourse is as irredeemably committed 
to abstract objects—to nations, species, numbers, functions, sets—as it 
is to apples and other bodies. All these things figure as values of the 
variables in our overall system of the world. The numbers and func- 
tions contribute just as genuinely to physical theory as do hypothetical 
particles (tbid., pp. 149-50). 


But, Quine’s opponent insists, the scientific hypotheses in our 
theory are tested by experiment, and the mathematical ones are not; 
surely the two can be distinguished on these grounds. To which 
Quine replies, 

The situation may seem to be saved, for ordinary hypotheses in natural 

science, by there being some indirect but eventual confrontation with 

empirical data. However, this confrontation can be remote; and, con- 
versely, some such remote confrontation with experience may be 
claimed even for pure mathematics and elementary logic. The sem- 
blance of a difference in this respect is largely due to overemphasis of 
departmental boundaries. For a self-contained theory which we can 
check with experience includes, in point of fact, not only its varous 
theoretical hypotheses of so-called natural science but also such por- 
tions of logic and mathematics as it makes use of.® 

Thus, the applied mathematics is confirmed-along with the physical 

theory in which it figures. 

Of course, it is not enough for a piece of mathematics simply to 
appear in a confirmed scientific theory. For any theory T, there is 
another theory T” just like T except that T’ posits a bunch of new 
particles designed to have no affect on the phenomena T predicts. 
Any experiment confirming T under these circumstances would also 
(in some sense) confirm T’, but we do not take this as evidence for 
the existence of the new particles because they are ‘‘dispensable,”’ 
i.e., there is an equally good, indeed better theory of the same phe- 
nomena, namely, T, that does not postulate them. The mathematical 
apparatus of modern physics does not seem to be dispensable in this 
way; indeed, Putnam has emphasized that many physical hypotheses 


5 “Success and Limits of Mathematization,” in Theories and Things, pp. 
148-55. 


6 “Carnap and Logical Truth,” p. 121. 
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cannot even been stated without reference to numbers, func- 
tions, etc.” 

So a simple indispensability argument for the existence of mathe- 
matical entities goes like this: we have good reason to believe our 
best scientific theories, and mathematical entities are indispensable 
to those theories, so we have good reason to believe in mathematical 
entities. Mathematics is thus on an ontological par with natural 
science. Furthermore, the evidence that confirms scientific theories 
also confirms the required mathematics, so mathematics and natural 
science are on an epistemological par as well. 

Unfortunately, there is a prima facie difficulty reconciling this 
view of mathematics with mathematical practice.* We are told we 
have good reason to believe in mathematical entities because they 
play an indispensable role in physical science, but what about mathe- 
matical entities that do not, at least to date, figure in applications? 
Some of these are admissible, Quine’ tells us, 


... insofar as they come of a simplificatory rounding out, but anything 
further is on a par rather with uninterpreted systems (1b1d., p. 788) 


So in particular, 
I recognize indenumerable infinites only because they are forced on 
me by the simplest known systematizations of more welcome matters. 


Magnitudes in excess of such demands, e.g. 3, or inaccessible numbers, 
I look upon only as mathematical recreation and without ontological 


rights.?° 
The support of the simple indispensability argument extends to 
mathematical entities actually employed in science, and only a bit 
beyond. 

The trouble is that this does not square with the actual mathemati- 
cal attitude toward unapplied mathematics. Set theorists appeal to 
various sorts of nondemonstrative arguments in support of their 
customary axioms, and these logically imply the existence of 3,. Inac- 


7 Hartry Field disputes this in his Science without Numbers (Princeton: Univer- 
mty Press, 1980), and Realism, Mathematics and Modality (Cambridge: Black- 
well, 1989), but the copious secondary literature remains unconvinced. 

* Vermons of this concern appear in C. Chihara, Constructibiltty and Maths- 
matical Existence (New York: Oxford, 1990), p. 15; and in my Realism in Mathe- 
matics (New York: Oxford, 1990), pp. 30-1. 

° “Review of Charles Parsons’s Mathematics in Philosophy,” this JOURNAL, 
D, 12 (December 1984): 783-94 

“Reply to Charles Parsons,” in The Philosophy of W. V. Quine, L. 
Hahn and P. Schilpp, eds. (La Salle, IL: Open Court, 1986), pp. 396—4053; here p. 
400. 
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cessibles are not guaranteed by the axioms, but evidence is cited on 
their behalf nevertheless. If mathematics is understood purely on 
the basis of the simple indispensability argument, these mathemati- 
_cal evidential methods no longer count as legitimate supports; what 
matters is applicability alone. Here simple indispensability theory 
rejects accepted mathematical practices on nonmathematical 
grounds, thus ruling itself out as the desired philosophical account 
of mathematics as practiced. 

So simple indispensability will not do, if we are to remain faithful 
to mathematical practice. We insist on remaining faithful to mathe- 
matical practice because we earlier endorsed a brand of naturalism 
that includes mathematics. But it is worth noting that even a retreat 
to purely nonmathematical naturalism (forgetting our commitment 
to actual mathematical practice) will not entirely solve this problem. 
From the point of view of science-only naturalism, the applied part 
of mathematics is admitted as a part of science, as a legitimate plank 
in Neurath’s boat; unapplied mathematics is ignored as unscientific. 
But even for applied mathematics there is a clash with practice. 
Mathematicians believe the theorems of number theory and analysis 
not to the extent that they are useful in applications but insofar as 
they are provable from the appropriate axioms. To support the 
adoption of these axioms, number theorists and analysts may appeal 
to mathematical intuition, or the elegant systematization of mathe- 
matical practice, or other intramathematical considerations, but 
they are unlikely to cite successful applications. So the trouble is not 
just that the simple indispensability argument shortchanges unap- 
plied mathematics; it also misrepresents the methodological realities 
of the mathematics that is applied. 

There is, however, a modified approach to indispensability consid- 
erations which gets around this difficulty. So far, on the simple 
approach, we have been assuming that the indispensability of (some) 
mathematical entities in well-confirmed natural science provides 
both the justification for admitting those mathematical things into 
our ontology and the proper methodology for their investigation. 
But perhaps these two—ontological justification and proper 
method—can be separated. We could argue, first, on the purely 
ontological front, that the successful application of mathematics 
gives us good reason to believe that there are mathematical things. 
Then, given that mathematical things exist, we ask: By what methods 
can we best determine precisely what mathematical things there are 
and what properties these things enjoy? To this, our experience to 
date resoundingly answers: by mathematical methods, the very 
methods mathematicians use; these methods have effectively pro- 
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duced all of mathematics, including the part so far applied in physi- 


- cal science. 


From this point of view, a modified indispensability argument first 
guarantees that mathematics has a proper ontology, then endorses 
(in a tentative, naturalistic spirit) its actual methods for investigating 
that ontology. For example, the calculus is indispensable in physics; 
the set-theoretic continuum provides our best account of the cal- 
culus; indispensability thus justifies our belief in the set-theoretic 
continuum, and so, in the set-theoretic methods that generate it; 
examined and extended in mathematically justifiable ways, this 
yields Zermelo-Fraenkel set theory. Given its power, this modified 
indispensability theory of mathematics stands a good chance of 
squaring with practice, so it will be preferred in what follows.!! 

OI. THE SCIENTIFIC PRACTICE OBJECTION 

My first reservation about indispensability theory stems from some 
fairly commonplace observations about the practice of natural 
science, especially physics. The indispensability argument speaks of 
a scientific theory T, well-confirmed by appropriate means and 
seamless, all parts on an ontological and epistemic par. This seam- 
lessness is essential to guaranteeing that empirical confirmation ap- 
plies to the mathematics as well as the physics, or better, to the 
mathematized physics as well as the unmathematized physics. 
Quine’s!? vivid phrases are well-known: “our statements about the 
external world face the tribunal of sense experience not individually 
but only as a corporate body” (#bid., p. 41). 

Logically speaking, this holistic doctrine is unassailable, but the 
actual practice of science presents a very different picture. Histori- 
cally, we find a wide range of attitudes toward the components of 
well-confirmed theories, from belief to grudging tolerance to out- 
right rejection. For example, though atomic theory was well-con- 
firmed by almost any philosopher’s standard as early as 1860, some 
scientists remained skeptical until the turn of the century—when 
certain ingenious experiments provided so-called “direct verifica- 
tion” —and even the supporters of atoms felt this early skepticism to 
be scientifically justified.'* This is not to say that the skeptics neces- 


l? This is more or less the position of my Realism in Mathematics. 

12? Two Dogmas of Empiricism,” repr. in From a Logical Potnt of View, pp. 
20-46 

13I trace the history in some detail m ‘Taking Naturalism Seriously,” in Pro- 
ceedings of the 9th International Congress of Logic, Methodology and Philo- 
sophy of Sctence, D. Prawitz, B. Skyrms, and D Westerstahl, eds. (Amsterdam: 
North Holland, forthcoming). 
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sarily recommended the removal of atoms from, say, chemical 
theory; they did, however, hold that only the directly verifiable con- 
sequences of atomic theory should be believed, whatever the explan- 
atory power or the fruitfulness or the systematic advantages of think- 
ing in terms of atoms. In other words, the confirmation provided by 
experimental success extended only so far into the atomic-based 
chemical theory T, not to the point of confirming its statements 
about the existence of atoms. This episode provides no comfort to 
the van Fraassenite, because the existence of atoms was eventually 
established, but it does show scientists requiring more of a theory 
than the sort of theoretical virtues typically discussed by philo- 
sophers. 

Some philosophers might be tempted to discount this behavior of 
actual scientists on the grounds that experimental confirmation is 
enough, but such a move is not open to the naturalist. If we remain 
true to our naturalistic principles, we must allow a distinction to be 
drawn between parts of a theory that are true and parts that are 
merely useful. We must even allow that the merely useful parts 
might in fact be indispensable, in the sense that no equally good 
theory of the same phenomena does without them. Granting all this, 
the indispensability of mathematics in well-confirmed scientific the- 
ories no longer serves to establish its truth. 

But perhaps a closer look at particular theories will reveal that the 
actual role of the mathematics we care about always falls within the 
true elements rather than the merely useful elements; perhaps the 
indispensability arguments can be revived in this way. Alas, a glance 
at any freshman physics text will disappoint this notion. Its pages are 
littered with applications of mathematics that are expressly under- 
stood not to be literally true: e.g., the analysis of water waves by 
assuming the water to be infinitely deep or the treatment of matter 
as continuous in fluid dynamics or the representation of energy as a 
continuously varying quantity. Notice that this merely useful mathe- 
matics is still indispensable; without these (false) assumptions, the 
theory becomes unworkable. 

It might be objected that these applications are peripheral, that 
they are understood against the background of more fundamental 
theories, and that it is in contrast with these that the applications 
mentioned above are ‘‘idealizations,” “models,” “approximations,” 
or useful falsehoods. For example, general relativity is a fundamen- 
tal theory, and when space-time is described as continuous therein, 
this is not explicitly regarded as less than literally true. So the argu- 
ment goes. 
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But notice, when those pre-Finsteinians were skeptical of atomic 
theory, it was a fundamental theory in this sense; it was not pro- 
posed against another background as a convenient idealization or 
mere approximation. The skeptics were bothered, not by such pe- 
ripheral simplifications, but by what they saw as the impossibility of 
directly testing the core hypotheses of atomic theory. But consider 
now the hypothesis that space-time is continuous. Has this been 
directly tested? As Quine himself points out, “no measurement 
could be too accurate to be accommodated by a rational number, 
but we admit the [irrationals] to simplify our computations and gen- 
eralizations.”!* Similarly, space-time must be regarded as continu- 
ous so that the highly efficacious continuum mathematics can be 
applied to it. But the key question is this: Is that continuous charac- 
ter ‘‘experimentally verified” or merely useful? Hf it is merely useful, 
then the indispensability argument sketched earlier, the one relying 
on the role of continuum mathematics in science to support the 
Zermelo-Fraenkel axioms, or ZFC, cannot be considered conclusive. 

I shall not try to answer this question here; to do so would require 
a more thorough study of the physics literature than I am capable of 
launching just now. But until such a study is undertaken, until the 
evidence for the literal continuity of space-time is critically exam- 
ined,’ I think the simple observations collected here are enough to 
raise a serious question about the efficacy of this particular indis- 
pensability argument. 

IV. FOUNDATIONS OF SET THEORY 
My second reservation about the indispensability arguments rests on 
somewhat less familiar grounds; to reach it, I must review a bit of set 
theory and (in the next section) a bit of physics. 

It is well-known that the standard axioms of contemporary set 
theory, ZFC, are not enough to decide every naturally-arising set 
theoretical question.!*® The most famous independent statement is 
the Cantor’s continuum hypothesis (CH), but there are others, some 
of them more down-to-earth than CH. For example, between the 
mid-seventeenth and the late nineteenth century, under pressure 
from both physical and mathematical problems, the notion of func- 


4 “Reply to Charles Parsons,” 

Coase cE the a St N eer ene 
sect. V. 

18 Kurt G&del’s incompleteness theorem is enough to establish that there are 
set-theoretic statements that can neither be proved nor disproved from ZFC, but 
Gddel’s later work on the inner model of constructible sets and Paul Coben’s 
forcing methods yield more: there are statements which mathematicians have 
found it natural to ask which are likewise independent. 
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tion became more and more general. Around the beginning of our 
century, various mathematicians undertook to bring order to the 
wild domain of discontinuous functions. It soon became clear that 
the complexity of functions could be understood in terms of the 
complexity of sets of real numbers—e.g., a function is continuous 
iff the inverse image of every open set is open—and this naturally 
led to a serious study of the properties of definable sets of reals. 
Among these, the Borel sets could easily be shown, for example, to 
be Lebesgue measurable. This result generalized to projections!” of 
Borel sets (the analytic sets) and to the complements of these (the 
coanalytic sets). One more application of projection produces the 
sets we now call Z}, but the question of their Lebesgue measurabi- 
lity remained stubbornly unsolved. Sometime later, this question 
was shown to be independent of ZFC."® 

In contrast with CH, this question concerns only a limited class of 
definable sets of reals, sets whose definitions have concrete geomet- 
Tic interpretations, and it involves the intuitive analytic notion of 
Lebesgue measurability rather than Cantor’s bold new invention, 
the comparison of infinite cardinalities. In other words, it might be 
said that this independent question, unlike CH, arose in the 
straightforward pursuit of analysis-as-usual. And there are others of 
this type. 

There is a serious foundational debate about the status of these 
statements. Despite their independence of ZFC, one might hold that 
there is nevertheless a fact of the matter, that the statements are 
nevertheless either true or false, and that it is the burden of further 
theorizing to determine which.’ At the other extreme, another 
might insist that ZFC is all there is to set theory, that a statement 
independent of these axioms has no inherent truth value, that the 
study of extensions of ZFC that settle these questions one way or the 
other are all equally legitimate. For future reference, let me attach 
labels to crude versions of these positions: let fact be the bare as- 
sumption that there is a determinate answer to our question, and let 
the opposing view be no-fact. 

Now let us pose this foundational question to the indispensability 
theorist, taking the simple version of indispensability first, for pur- 


17 The projection of a subset of the plane is its shadow on one of the coordinate 


axes. 
'* I discuss this history in more detail in ch. 4 of Realism ın Mathematics and 
in “Taking Naturalism Seriously.” 
19 This was Gödel’s view, and the one defended in Realism in Mathematics. 
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poses of comparison.” Continuum mathematics, including every- 
thing from real-valued measurement to the higher calculus, is 
among the most widely applied branches of mathematics, and at 
least some of the many physical theories in which it is applied are 
extremely well-confirmed. Therefore, so the argument goes, we 
have good reason to believe in the entities of continuum mathemat- 
ics, for example, the real numbers. In these applications—for exam- 
ple, in the theory of space (or space-time)—we also find quantifica- 
tion over sets of reals, (or equivalently, over regions of space (or 
space-time)), though particular instances are rarely as complex as 
23.7! If we believe in the reals and in those sets of reals definable in 
our theory, then it seems we should accept the legitimacy of the 
question: Are 2} sets Lebesgue measurable?” Thus the simple indis- 
pensability theorist should endorse fact. 

What follows from this for the practice of set theory? Should the 
set theorist, as Gödel suggests, seek an answer to this legitimate 
question? Given that our independent question seems, for now, to 
have no bearing on physical theory, and that it is not settled by the 
most generous of “simplificatory rounding outs” (i.e., ZFC), the 
simple indispensability theorist, who uses only the justificatory 
methods of physical science, has no means for answering it. Further- 
more, the exclusive focus on the needs and methods of physical 
science hints at a lack of interest in any question without physical 
ramifications. If so, the simple theorist may disagree with no-fact, 
classifying our independent question as one with an unambiguous 
truth value, without going so far as charging future theorists with 
the task of answering it. Call this weak fact. 

Fortunately, these subtle matters of interpretation are beside the 
point here, because we have already identified the modified indis- 
pensability argument as more promising than the simple. Like the 
simple theorist, the modified indispensability theorist embraces the 
ontology of continuum mathematics, on the basis of its successful 
applicability, and thus the legitimacy of our independent question, 
but she goes beyond the simple theorist by ratifying the set theorist’s 


* For now, I shall ignore the scientific practice objection and take fundamental 
scientific theory at face value. 

*' One theoretical proposal that involves nonmeasurable sets of reals is I Pi- 
towaky, ‘‘Determunistic Model of Spin and Statistics,” Physical Review D, XXVI, 
10 (15 May 1988). 2816-26 (M. van Lambalgen called this paper to my atten- 
tion.) 

2 This might be avoided if we took mathematical entities to be somehow “‘in- 
complete,” an idea of Parsons’s which Quine considers in passing See Quine’s 
“Reply to Charles Parsons,” p. 401, and references cited there 
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search for new axioms to answer the question. Call this strong fact. 
Although the evidence for or against these axiom candidates will 
derive, not from physical applications, but from considerations in- 
ternal to mathematics, the modified theorist sees the past success of 
such mathematical methods as justifying their contin- 
ued use. 

Either way, then, the indispensability theorist should adopt some 
version of fact. Notice, however, that this acceptance of the legiti- 
macy of our independent question and (for the modified theorist) 
the legitimacy of its pursuit is not unconditional; it depends on the 
empirical facts of current science. The resulting mathematical be- 


liefs are likewise a posteriori and fallible. 
V. FOUNDATIONS OF PHYSICS 


Here the fundamental theories are general relativity and quantum 
mechanics, and the central problem is to reconcile the two. Three of 
the fundamental forces—electromagnetic, weak, and strong—have 
yielded more or less workable quantum-field descriptions, but grav- 
ity remains intractable. The mathematics of quantum-field theories 
has an annoying habit of generating impossible (infinite) values for 
some physical magnitudes, but the problem has been overcome by a 
technical trick called “renormalization” in all cases but that of grav- 
ity. So the problem remains: How to characterize the gravitational 
force at quantum distances? 

Physicists engage in fascinating speculations on the source of the 
difficulty: for example, that it arises from the attempt to combine 
the essentially passive space-time of quantum theory with the dy- 
namic space-time of general relativity.” But most important for our 
purposes is the idea that the fault might lie in our conception of 
space-time as a mathematical continuum. For example, Richard 


Feynman™ writes: 
I believe that the theory that space is continuous is wrong, because we 
get these infinities and other difficulties . . . (2b1d., p. 166). 


And Chris Isham: 


... it is clear that quantum gravity, with its natural Planck length, raises 
the possibility that the continuum nature of spacetime may not hold 
below this length, and that a quite different model is needed (op. cit., 
p. 72). 


™ See Chris Isham, “Quantum ae in The New Physics, P. Davies, ed. 
(New York: Cambridge, 1989), pp. 70-93, esp. p. 70 I review some of the popu- 
lar literature on this problem in ‘Taking Naturalism Seriously.” 

™ The Character of Physical Law (Cambridge: MIT, 1967). 
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Here the suggestion is not (as in the previous section) that the conti- 
nuity of space-time is a “mere idealization,” but that it does not 
belong in our best theory at alll 

All this, as I have indicated, is quite speculative; no one yet knows 
what a reasonable theory of quantum gravity might be like. But the 
very suggestion that space-time may not be continuous is enough to 
add previously unimagined poignancy to our earlier conclusion of a 
posteriority and fallibility. What seemed a rather small concession at 
the end of the last section—that the grounds for the indispensability 
theorist’s adherence to fact could be overthrown by progress in 
physics—now looms as a real possibility. 

Of course, the (potential) falsity of continuum mathematics in its 
application to space-time would be only part of the story; there are 
other successful scientific uses for the calculus and higher analysis. 
But, if science were to change so that all fundamental theories were 
thoroughly quantized, so that no continuum mathematics appeared 
there, if all the remaining applications of continuum mathematics 
were explicitly understood as “approximations” or “‘idealizations”’ 
or ‘‘models,” then even the modified indispensability theorist would 
retreat to some version of no-fact. The case of quantum gravity 
should keep us from dismissing this possibility out of hand. 

VI. THE MATHEMATICAL PRACTICE OBJECTION 

For the modified indispensability theorist, the choice between 
strong fact and no-fact hinges on developments in physics (and per- 
haps the rest of science). We should now ask the impact of this 
choice: How would the pursuit of our independent question be af- 
fected by it? In other words, we want to know if the metaphysical 
distinction between strong fact and no-fact has methodological con- 
sequences. 

If no-fact is correct, if there is no pre-existing fact to discover 
about the Lebesgue measurability of 2} sets of reals, then what 
approach should the set theorist take? Many observers would hold 
that no-fact is the end of the story, that mathematicians are in the 
business of discovering truths about mathematical reality, and that, 
if there is no truth to be found, the mathematician should reject the 
question. From this point of view, all (relatively consistent) set the- 
ories extending ZFC are equally legitimate, there is no call to chose 
between them, and indeed, no grounds on which to do so apart 
from subjective aesthetic preferences. Once our question is shown 
to be independent, and developments in science undercut the claim 
to inherent truth value, there is nothing more of serious import to 
be said about the Lebesgue measurability of Z sets. Call this end of 
the story no-fact. 
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This position makes nonsense of the contemporary search for new 
set-theoretic axioms to settle independent questions like ours. In- 
deed, in our case, there are two competing candidates: V = L 
(Gédel’s “axiom of constructibility’”) and MC (the existence of a 
measurable cardinal). If V = L, then there is a non-Lebesgue mea- 
surable Z} set; if MC, then all Z} sets are Lebesgue measurable. Set 
theorists offer arguments for and against these axiom candidates,” 
and in this debate, MC is strongly favored over V = L.?® If we are 
not to reject this activity as inconsequential mutterings—an espe- 
cially unappealing move, given that the original axioms of ZFC are 
supported by arguments of a similar flavor—we must instead reject 
end of the story no-fact. 

But there is another version of no-fact. Even if there is no pre-ex- 
isting fact of the matter to be discovered, the process of extending 
the axioms of set theory might well be governed by nonarbitrary 
principles. This idea turns up, not only in the study of set theory, 
but when ontological decisions are made in other branches of mathe- 
matics as well. For example, Kenneth Manders?” describes the theo- 
retical norms at work in the expansion of the domain of numbers to 
include the imaginary or complex numbers, and Mark Wilson™ un- 
covers the rationale behind the move from affine to projective geom- 
etry. In such cases, despite lip service to the notion that any consis- 
tent system is as good as any other, mathematicians actually insist 
that a given mathematical phenomenon is correctly viewed in a cer- 
tain (ontological) setting, that another setting is incorrect. 

One need not assume fact to endorse these practices. Even if 
there is no fact of the matter, no pre-existing truth about the exis- 
tence or nonexistence of complex numbers or geometric points at 
infinity or nonconstructible sets, the pursuit of these mathematical 
topics might be constrained by mathematical canons of ‘‘correct- 

ess.” For our case, one might hold that there is no fact of the 
matter about the Lebesgue measurability of 2} sets, but that there 
are still good mathematical reasons to prefer extending ZFC in one 
way rather than another, and perhaps, good mathematical reasons 
to adopt a theory that decides our question one way rather than 
another. From this point of view, no-fact is just the beginning of the 


‘Believing the Axioms I-II,” The Journal of Symbolic Logic, LI, 2 
Gune 1988): 481,517 481-511, and 3 (September 1988): 736-64. 
7 I discuss part of the case against V = L in “Does V equal L?” in The Journal 


of Symbolic Logic (forthcoming). 
“Domain Extension and the Philosophy of Mathematics,” this JOURNAL, 
LXXXVI, 10 (October 1989): 553—62. 
™ “Frege: The Royal Road from Geometry,” in Nowis (forthcoming) 
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story; it opens the door on the fascinating study of purely mathemat- 
ical canons of correctness. Call this beginning of the story no-fact. 

The question before us is this: What are the methodological con- 
sequences of the choice between strong fact and no-fact? If strong 
fact is correct, the set theorist in search of a complete theory of her 
subject matter should seek out additional true axioms to settle the 
Lebesgue measurability of Z} sets (and so on). If end of the story 
no-fact is correct, the set theorist left with any interest in the matter 
should feel free to adopt any (relatively consistent) extension of ZFC 
she chooses, or even to move back and forth between several mutu- 
ally contradictory such extensions at will. And finally, if beginning 
of the story no-fact is correct, the set theorist should use appropriate 
canons of mathematical correctness to extend ZFC and to decide 
the question.” 

Obviously, the method prescribed by strong fact differs from that 
prescribed by end of the story no-fact; that much is easy. But what 
about strong fact and beginning of the story no-fact? Does the pur- 
suit of truth differ from the pursuit of mathematical correctness? In 
fact, I think it does. Consider, for example, a simple argument that 
V = Lshould be rejected because it is restrictive. A supporter of this 
argument owes us an explanation of why restrictive theories are bad. 
A beginning of the story no-fact-er might say, “because the point of 
set theory is to realize as many isomorphism types as possible, and 
set theory with MC is richer in this way.’ A strong fact-er might 
agree that the world of MC has desirable properties, while insisting 
that desirability (notoriously!) is no guarantee of truth.” Faced with | 
the beginning of the story no-fact-er’s argument, a strong fact-er 
would reply, ‘‘Yes, MC is nice in the way you indicate, but if V does 


™ The mathematical canons mvoked in beginning of the 10 Poput could 
ultimately recommend that several different set theories be acco equal status 
The methodology at work would still be different from that of end of the story 
no-fact, and the range of theories endorsed would almost certainly be narrower. 

Spelling out this line of thought precisely is no simple exercise, but I shall 

leave that problem aside here. The point is just that the beginning of the story 
Gel aS arin belie 

3! Philip Kitcher touches on this point in his reply to Manders, ‘Innovation and 
Understanding in Mathematics,” this JOURNAL, LXXXVI, 10 (October 1989): 563— 
4, when he wnites, p. 564° “Suppose this is a way in which mathematical knowl- 
edge can grow. What kinds of views of mathematical reality and mathematical 
progress are open to us? Can we assume that mvoking entives that satisfy con- 
straints we favor is a legitimate strategy of recognizing hitherto negiected objects 
that exist independently of us? From a realist perspective, the method of postulat- 
ing what we want has (in Bertrand Russell’s famous phrase) ‘the advantages of 
theft over honest toil.’ Lf that method is, as Richard Dedekind supposed, part of 
the honest trade of mathematics, is something wrong with the realist perspec- 
tive?” 
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equal L, L does contain all the isomorphism types possible. What’s 
needed is an argument that V = L is false.” So strong fact will differ 
methodologically from beginning of the story no-fact as well as end 
of the story no-fact. 

We have reached this point: a methodological decision in set 
theory—namely, that between the methodologies proper to strong 
fact and to beginning of the story no-fact—hinges on developments 
in physics. If this is correct, set theorists should be eagerly awaiting 
the outcome of debate over quantum gravity, preparing to tailor the 
practice of set theory to the nature of the resulting applications of 
continuum mathematics. But this is not the case; set theorists do not 
regularly keep an eye on developments in fundamental physics. Fur- 
thermore, I doubt that the set-theoretic investigation of indepen- 
dent questions would be much affected even if quantum gravity did 
end up requiring a new and different account of space-time; set 
theorists would still want to settle open questions about the mathe- 
matical continuum. Finally, despite the current assumed indispensa- 
bility of continuum mathematics, I suspect that the actual approach 
to the Lebesgue measurability of 2} sets, to V = L versus MC, is 
more like that prescribed by beginning of the story no-fact than that 
prescribed by strong fact,®* and I see no mathematical reason to 
criticize this practice. In short, legitimate choice of method in the 
foundations of set theory does not seem to depend on physical facts 
in the way indispensability theory requires. 

VII CONCLUSION 
I have raised two doubts about indispensability theory, even modi- 
fied indispensability theory, as an account of mathematics as prac- 
ticed. The first, the scientific practice objection, notes that indispen- 
sability for scientific theorizing does not always imply truth and calls 
for a careful assessment of the extent to which even fundamental 
mathematized science is “idealized” (i.e., literally false). The second, 
the mathematical practice objection, suggests that indispensability 
theory cannot account for mathematics as it is actually done. If these 
objections can be sustained, we must conclude that the indispensa- 
bility arguments do not provide a satisfactory approach to the ontol- 
ogy or the epistemology of mathematics. Given the prominence of 
indispensability considerations in current discussions, this would 
amount to a significant reorientation in contemporary philosophy of 
mathematics. 
PENELOPE MADDY 

University of California/Irvine 


* I hope to argue this in some detail elsewhere. 
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THE SKELETON IN FREGE’S CUPBOARD: THE STANDARD 
VERSUS NONSTANDARD DISTINCTION 


nyone who uses higher-order logic faces a momentous 

choice concerning the interpretation of higher-order quan- 

tifiers. Let us consider as an example a second-order quan- 
tifier that involves a one-place class or predicate variable, say, X. Its 
values can be taken to be either classes of individuals or properties 
(concepts) of individuals. In either case, the same dilemma con- 
fronts a higher-order logician, even if its horns look different on the 
two interpretations. 

If the values of X are thought of as being classes, then the crucial 
question is whether the range of a quantifier containing X is the 
entire power set P(do(M)) of the relevant domain do(M) of individ- 
uals, or only some designated subset of P(do(M)). In other words, 
the question is whether the values of X are arbitrary extensionally 
possible classes or whether only some such classes are accepted as 
values of X. The former alternative results in what is usually called 
the standard interpretation of higher-order logic, the latter in a 
nonstandard interpretation. 

A nonstandard interpretation can be of many different kinds. The 
relevant range of X is sometimes thought of as a part of the specifica- 
tion of the model M in which a higher-order formula is being inter- 
preted, sometimes as a restraint on the mterpretation. Often, the 
range of X and the ranges of other higher-order variables cannot be 
selected completely arbitrary, but are subject to certain closure con- 
ditions. For instance, the totality of these different ranges is as- 
sumed to be closed with respect to Boolean operations and to pro- 
jective ones; in other words, the usual formation rules are assumed 
to preserve interpretability.’ 

The same distinction has to be made for variables of each logical 
type. An especially important case in point is the interpretation of 
function variables, as witnessed by the fact that a specific nonstan- 
dard interpretation is sometimes imposed on them on purpose. The 


1 Leon Henkin, “Com eteness in the Theory of Types,” The Journal of Sym- 
bolic Logic, xv (1950): 81-91. For an important correction to Henkin’ r pae 
see Peter B. Andrews, ‘General Models and Extensionality,” The Jou of 
Symbolic Logic, XXXVO (1972): 395-7. 
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best known essay in this direction is probably Kurt Gédel’s? func- 
tional interpretation of first-order logic and arithmetic. There the 
values of function variables are restricted to recursive functions of 
the appropriate type. Notwithstanding the variety of different types 
of entities and their intricate interrelations, the apparently special 
case of one single second-order quantifier with a one-place predi- 
cate or class variable is fully representative of the entire theory of 
finite types. For there are results that show that, if the standard 
interpretation is granted to one single second-order one-place 
quantifier, the entire higher-order logic (theory of finite types) with 
the standard interpretation can be reconstructed by its means.® Of 
course, by the same token the interpretation of a single-function 
variable determines in a sense the interpretation of the entire 
higher-order logic, for classes can always be handled by means of 
their characteristic functions. 

Philosophers have not paid adequate attention to the standard 
versus nonstandard distinction. The main reason is undoubtedly 
their slowness to appreciate the importance of the model-theoretical 
viewpoint in logical theory.* Be this as it may, the contrast was 
spelled out explicitly by Leon Henkin in 1950. The fact that he 
considered only one nonstandard interpretation does not detract 
from his achievement. 

I. CONCEPTS VERSUS THEIR EXTENSIONS 
The interpretational dilemma cannot be avoided by switching to 
variables that range over properties or relations in contradistinction 
to their extensions. Then the question whether the standard inter- 
pretation is assumed becomes the question as to whether for each 
class (potential extension), C, there is a property that has C as its 
extension or, in Gottlob Frege’s term, as its course-of-values. 

This distinction is extended as a matter of course to variables of 
other higher-order types. As a bargain, we are put into the position 
of being able to distinguish the main problem addressed in this 
paper from two others. In this paper, we are asking whether Frege 
assumed that for each potential extension there is a concept that has 
this extension as its value range. One can also ask the mirror-image 
question: Is there an extension for each concept? It turns out to be 


? “On a Hitherto Unexploited Extension of the Finitary Standpomt,” Journal 
of Philosophical Logic, 1x (1980): 133-42 

* For such results, see Hintikka, “Reductions in the Theory of Types,” Acta 
Philosophica Fennica, v (1955): 59-115 

* This is documented in Hintikka, “On the Development of the Model-Theo- 
retic Viewpoint in Logical Theory,” Synthese, Lxxvo (1988): 1-36 
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extremely important to separate the two questions from each other, 
both in discussing Frege and in discussing other logicians and mathe- 
maticians (cf. below). 

We can also formulate a third question: Are two concepts with the 
same extension identical? This question is often combined with the 
first two and formulated as the query: Is logic extensional? It turns 
out, however, that often the philosophers who ask this question 
really have in mind only one of the three questions. It is also impor- 
tant to realize that an ‘‘extensionalist’’ answer to the question con- 
cerning the identity conditions of functions (including concepts), 
which is Frege’s answer, by no means prejudices his answer to either 
of the other two questions. 


II THE STANDARD VERSUS NONSTANDARD DISTINCTION 
AND THE EXPRESSIBILITY OF CONCEPTS IN LANGUAGE 


Sometimes, a nonstandard interpretation is guided by the idea that 
only such properties, relations, and functions can be assumed to 
exist as can be defined or otherwise captured by a suitable expres- 
sion of one’s language. In the case of theories with infinite models, 
this leads inevitably to a nonstandard interpretation, for there can 
be only a countable number of such definitions or characterizations 
available for this purpose. Hence they cannot capture all the subsets 
of do(M), for there is an uncountable number of them. 

This particular nonstandard interpretation was the only one con- 
sidered by Henkin. More generally, it seems to be the only one that 
is considered in most of the literature where the Henkin distinction 
is relied on or discussed. It is nevertheless important to realize that 
other nonstandard interpretations are possible. What is crucial is to 
realize that the nonstandard versus standard distinction is not the 
same as the distinction between, on the one hand, variables each of 
whose values always has a representative in language which can serve 
as a substitution value for the variable and, on the other, variables 
whose values can exist independently of their linguistic representa- 
tions. The latter issue can be called the problem of nonlinguistic 
existence (of values of higher-order variables). Historically, it may 
have been the source of the standard versus nonstandard distinc- 
tion, but, systematically, it is independent of this distinction. In gen- 
eral, representability in a language on the basis of a theory depends 
on the language and on the theory, while the choice between the 
standard and a nonstandard interpretation faces us in using any 
higher-order language. 

A confusion between the different questions we have just distin- 
guished is not merely an abstract possibility. The fallacy of not sepa- 
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rating them has been committed even by knowledgeable philo- 
sophers. For instance, in an article in Erkenninis, Rudolf Carnap” 
criticizes Frank Ramsey’s reliance on the standard interpretation 
and writes: 
I think we should not let ourselves be seduced by it into accepting 
Ramsey's basic premise, viz., that the totalities of properties already 
exist before their characterization by definition. . . 
I think we ought to hold fast to Frege’s dictum that in mathematics 
only that may be taken to exist whose existence has been proved (and 
he meant proved in finitely many steps) (zbid., p. 102). 


Here Carnap is clearly misinterpreting Ramsey’s position. In so far 
as we are dealing with what Ramsey called predicates and relations 
“in the ordinary sense,” Ramsey apparently accepts the idea that 
there has to be an expression for them. In this sense, Ramsey in fact 
discusses how functions are constructed by constructing expressions 
for them. Ramsey’s point is that our entire logic has to be reinter- 
preted so as to deal, “not with relations in ordinary sense, but with 
possible correlations, or ‘relations in extension’ as Mr. Russell calls 
them,” and likewise for entities of other logical types. In fact, Car- 
nap seems to misinterpret Frege’s position, too, or at least he gives a 
wrong description of it, as we shall see. 

A terminological point is in order here. By calling one of the two 
contrasting interpretations “‘standard,” we are not passing any judg- 
ment as to which view ought to be adopted or which one is histori- 
cally the usual one. We are merely following the terminology of 
Henkin, who was the first logician to articulate fully the standard 


versus nonstandard distinction. 
Ill. THE STANDARD VERSUS NONSTANDARD DISTINCTION 
AND SET THEORY 


On the basis of what has been said, it might seem that the standard 
interpretation has set theory as its main fortress. In the first section 
of this paper, we explained what the standard interpretation is by 
means of the familiar set-theoretical notion of the power set. It is 
undoubtly true that Georg Cantor intuitively assumed some sort of 
standard interpretation in his thinking. It can even be argued, as 
Michael Hallett® has aptly done, that the actual generated principles 
for transfinite ordinals go “most of the way toward acceptance of 
the existence of arbitrary sub-domains” and therefore toward the 
standard interpretation. As Hallett writes: 

5 “Die logizistische Grundlegung der Mathemank,” Erkenntnis, 0 (1981): 91- 


105. 
* Cantonan Set Theory and Limitation of Size (New York: Oxford, 1990). 
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Cantor asserts that the two principles of generation [he had formu- 
lated] ‘give us the ability to break through every barrier in the forma- 
tion of real, whole numbers’ (the italics are Cantor’s). This certainly 
suggests a belief that these methods take the ordinal number sequence 
arbitrarily far (ibid., p. 58). 


But the real question here is whether later set theorists were able to 
implement Cantor’s intuitive ideas so as to turn what Hallett guard- 
edly refers to as “most of the way” into all the way to Cantor’s 
implicitly “standard” way of thinking about sets and their existence. 
The right answer to this question is a resounding no. 

One way of looking at contemporary axiomatic set theory is to 
view it as an attempt to deal with such higher-order entities as sets 
on the first-order level. What inevitably gets lost in such an attempt 
is precisely the possibility of a standard interpretation of variables 
ranging over higher-order entities. This problem manifests itself, 
among other things, in the form of the Skolem paradox. Even when 
you seem to be able to prove the existence of all sorts of uncount- 
able sets in your axiom system, it turns out that there are, according 
to the downward L6wenheim-Skolem theorem, models of the same 
axiom system in which those very sets are countable. Indeed, the 
very paradox about the Lowenheim-Skolem theorem lies in the fact 
that what in the system is called countability or uncountability is that 
only on the standard interpretation of set variables understood as 
higher-order ones. 

It does not automatically help, either, to move to a higher-order 
level: higher-order logics can typically be treated as if they were 
many-sorted first-order logics as long as the standard interpreta- 
tion is not assumed. Thus, in a sense it is the standard versus non- 
standard interpretation that constitutes the real watershed between 
first-order logic and higher-order logics. At least this distinction is 
much more consequential than a mere distinction between types 
(order levels). Since Frege’s logic was a higher-order one, we are 
already beginning to see that the standard versus nonstandard dis- 


tinction puts ais entire project into a sharp new light. 
. THE IMPORTANCE OF THE DISTINCTION 


We called the HEOL versus nonstandard distinction momentous. 
This was in reality an understatement. Each horn of the dilemma 
turns out to be very sharp, in that the prospects of higher-order 
logic and of its applications depend essentially on one’s choice be- 
tween the two kinds of interpretations, in more than one way. 

For one important thing, second-order logic with a standard in- 
terpretation is easily seen to be semantically incomplete. In other 
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words, the set of valid formulas is not recursively enumerable on this 
interpretation. There cannot exist a finite (or recursively enumer- 
able) set of logical axioms and inference rules that would generate 
all and only valid formulas as theorems. In contrast, Henkin showed 
that, on one especially natural nonstandard interpretation, higher- 
order logic does after all admit of a semantically complete axiomati- 
zation (op. cit., p. 81). Many other kinds of nonstandard interpreta- 
tion likewise yield axiomatizable logics.” This does not mean that we 
should prefer a nonstandard interpretation. For only the standard 
interpretation of second-order logic enables us to use it for the most 
important purposes it can serve in the foundations of mathematics. 
If we assume the standard interpretation, we can easily formulate 
descriptively complete and indeed categorical axiomatizations for 
such crucial mathematical theories as number theory and the theory 
of real numbers.® Likewise, the principle of mathematical induction 
is easily formulated in a second-order logic with standard interpre- 
tation, but cannot be formulated in an axiomatizable nonstandard 
higher-order logic.® Thus, for most serious foundational purposes 
we have to assume the standard interpretation of second-order 
logic.!° 

ney THE ROLE OF THE DISTINCTION IN FOUNDATIONAL DISCUSSIONS 
Even though the standard versus nonstandard distinction has not 
attracted much attention from philosophers, it has played an impor- 
tant, albeit only partially articulated, role in the actual foundational 
discussions. For instance, the standard versus the nonstandard con- 
trast is the most important determinant of a mathematician’s atti- 
tude to the axiom of choice. On the standard interpretation, the 


7 See also Hintikka, “Standard vs. Nonstandard Logic: Higher Order, Modal 
and First-Order ics,” in Modern Logic: A Survey, E. Agarzi, ed. (Boston: 
Reidel, 1980), pp. 283-96. 

* Henkin, p. 89. See also Steward Shapiro, Foundations without Foundation- 
alism (New Por: Oxford, 1991). 2 

? For this point, see iro, and also Thoralf Skolem, “Uber emige Grund- 
lagenfragen der ” Skrifter utgitt av det Norske Videnskaps-Aka- 
demi, 1, 4 (1929): 1—49. 

© The whole business of axiomatizability is nevertheless extremely subtle. Per- 
baps the most interesting nonstandard interpretation of first-order logic is ob- 
tained by restricting Skolem functions to recursive ones, somewhat as in Gddel’s 
functional interpretation, but without any unusual red deena of negation and 
conditionals. Then the resulting logic is nonaxi irable, as was first pomted out 
by Robert Vaught in “Sentences True in All Constructible Models,” The Journal 
of Symbolic Logic, xxv (1960): 39-53. The Peano axiomatization of elementary 
arithmetic is then complete, however, as was ae eae out by Tennenbaum. 
For this later point, see Richard Kaye, Models of Peano Arithmetic (New York. 
Oxford, 1991), esp. pp. 153-7. 
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axiom of choice is trivially acceptable, whereas for an advocate of a 
nonstandard interpretation, there is prima facie little reason to be- 
lieve in the axiom of choice. This important background of-the ca- 
reer of the axiom of choice is almost universally neglected.!! 

Likewise, the standard versus nonstandard distinction prejudices 
one’s judgment of Bertrand Russell’s axiom of reducibility, which 
roughly says that for each impredicatively defined set there exists an 
equivalent predicatively definable class. For a believer in the stan- 
dard interpretation, a class is there, determined by its members, no 
matter how it is defined in some one language. Hence, for the theo- 
rist of the standard sort, the “axiom of reducibility”’ is trivially true, 
indeed redundant, as was forcefully pointed out by Ramsey.’ In 
general, Ramsey’s revision of Russell’s ramified theory of types is to 
all intents and purposes a self-conscious step from a nonstandard to 
a standard higher-order logic. 


VI. STANDARD INTERPRETATION 
AND THE IDEA OF ARBITRARY FUNCTION 


We can say that there were two long-range developments of mathe- 
maticians’ thinking about the foundations of their own field which 
led to the idea of standard interpretation: 


G) The development of the idea of an arbitrary function. 
(ii) The development of the idea of arbitrary set. 


Thinking about (i) and (ii) was conditioned not so much by philo- 
sophical questions as by the demands of mathematical theorizing 
itself. Let us turn first to the notion of arbitrary function. 

The development of the idea of an arbitrary function is one of the 
most important conceptual developments in mathematics. This de- 
velopment has not always been fully appreciated. For instance, the 
concept of arbitrary function is often credited to P. G. L. Dirichlet, 
even though in reality it was formulated as early as 1755 by L. Euler. 
In his Institutiones calculi differentialis, Euler wrote: 


If some quantities so depend on other quantities that is the latter are 


changed the former undergo change, then the former quantities are 
called functions of the latter. This denomination is of the broadest 


nature and comprises every method by means of which one quantity 


11 This is the case even with Gregory Moore's i rtant book, Zermelo’s Ax- 
tom of Choice: Its Origins, Development and Inlusnce (New York: Springer, 
1982). 

12 “The Foundations of Mathematics,” Proceedings of the London Mathemats- 
cal Soctety, 1, 25, (1925): 338-84. 
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could be determined by others. If, therefore, x denotes a variable 
quantity, then all quantities which depend upon x in any way or are 
determined by it are called functions of it.!8 


Similar quite general characterizations of a function are found in 
other mathematicians, for instance in N. I. Lobachevsky. According 
to I. H. Anellis,'* Lobachevsky thought that “general thought de- 
mands that a function [f] of x be called a number which is given for 
each x and which changes ... together with x.” The case of Loba- 
chevsky is interesting because the development of non-Euclidean 
geometries was one of the factors that prompted mathematicians to 
consider a wider class of structures than before: for instance, be- 
cause not all non-Euclidean geometries can be modeled within the 
Euclidian one. 

In Euler’s time, the concept of arbitrary function had little use in 
state-of-the-art mathematical research. Actual work both in the 
foundations of mathematics and at the cutting edge of the subject 
gradually raised the consciousness of most mathematicians in the 
course of the nineteenth century. The innocent looking Eulerian 
definition of an arbitrary function turned out to admit of surpris- 
ingly unruly denizens of the world of mathematics. For instance, 
many of the results that had been taken more or less for granted 
turned out to hold only for functions with specifiable properties. 
For instance, it turned out that even a continuous function need not 
be differentiable. This development enhanced, on the other hand, 
the interest of such results as could be proved for arbitrary func- 
tions or at least for a large class of functions. Important examples 
are provided by functions that can be represented by means of 
Fourier series and by functions that are integrable, on one of the 
various extended definitions of integration. 

As a consequence of such developments, a lively interest arose in 
the concept of function and in the question as to how wide a class of 
functions (possibly including all arbitrary functions) mathematicians 
should be concerned with. The acceptance or rejection of the con- 


13 This passage from Institutiones calculi differentials is translated by A. P 
Yushevich: “The Concept of Function up to the e of the 19th,” Arch. Hisi. 
Exact. Sci., xvi (1970), p. 70. For additional information on the development of 
the idea of an arbi i see also Umberto Bottazzmi, The Higher Cal- 
culus: A History of Real and Complex Analysis from Euler to Weierstrass, 
(New York: Springer, 1986). 

1 A History of Mathematical Logic in Russia and the Soviet Union, unpub- 
lished. 
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cept of arbitrary function was, for such reasons, an important water- 
shed in the foundational discussions in the nineteenth century. 

The tension generated by this question is in evidence in several 
leading mathematicians. For instance, K. T. Weierstrass considered 
the idea of arbitrary function, but rejected it as being so vague and 
so general as not to enable us to find any interesting. things to say 
about it as such. The other side of the Weierstrassian idea of func- 
tion, however, was his desire to consider as large a class of functions 
as possible, to find “the largest class of functions for which one can 
give an analytic representation and which can most fully satisfy the 
needs of analysis.” !” 

L. Kronecker, in contrast, rejected clearly the idea of an arbitrary 
function. As Harold M. Edwards’® puts it, “what is missing, and 
what is excluded by his [Kronecker’s] principles is integrals of ‘arbi- 
trary’ functions or sums of ‘arbitrary’ infinite series” (ibid., p. 74). 

All these developments are highly relevant here, for the concep- 
tion of the standard interpretation of higher-order quantifiers (and 
variables) is, to all purposes, equivalent with the notion of an arbi- 
trary function or the notion of an arbitrary set, as was indicated in 
section I above. For the same reason it is not anachronistic to relate 
foundational discussions of the late nineteenth century to the stan- 
dard versus nonstandard distinction, even though it entered into 
actual discussion disguised by a different terminology. 

VI. PRIMACY OF CONCEPTS OVER THEIR EXTENSIONS IN FREGE 
After all these preparations, we can turn to the question whether 
Frege assumed the standard interpretation of higher-order quanti- 
fiers or a nonstandard one. This question is especially poignant 
when applied to Frege, for his logic contains explicitly formulated 
higher-order quantifiers. We shall show that Frege lacked both the 
idea of arbitrary function and the idea of arbitrary set, and hence in 
effect opted for a nonstandard interpretation of higher-order quan- 
tifiers. Let us deal first with the latter question. 

For Frege, classes as classes come into play only through being 
Wertverlaufe of functions (in Frege’s wide sense of the term ‘func- 
tion’). He distinguished very clearly a concept, denoted by a concept 
word ‘&(£)’ from its extension, denoted by a course-of-values-name 


18 See P Dugac, “Eléments d'analyse de Karl Weierstrass,” Archive for History 
of Exact Sciences, x (1973): 41-176. 

16 “Rronecker’s view on the Foundations of Mathematics,” in The History of 
Modern Mathematics, Vol. I: Ideas and Their Reception, David E. Rowe and 
Jobn McCleary, eds. (San Diego: Academic, 1989) 
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‘e®(e)’.!7 As is well-known, one basic difference in Frege between a 
concept and the extension corresponding to it is that the latter is an 
object whereas the former is not. A concept is a function whose 
value is always a truth value. The extension of the concept, or alter- 
natively the course of values of the function which the concept is, is 
an object that is determined by the arguments and the correspond- 
ing values of the function for these arguments. It is important to 
keep in mind that both concepts and courses of values are exten- 
sional (ibid., pp. XXXIX, 36). 

The relation between a concept and its course of values is, con- 
trary to what one might think, logical and not empirical. That is, the 
extension of a concept can only be apprehended by our logical facul- 
ties starting out from the concept.'® Accordingly, a course of values 
does not exist independently, but only tied to a concept (function) 
which is logically more primary than it: 


... In fact I do hold that the concept is logically prior to its extension, 
and I regard as futile the attempt to base the extension of a concept or 
a class not on the concept but on individual things.” 


Consequently, we have no means of recognizing a course of values 
except through a concept: 


We have only a means of always recognizing a course-of-values tf it is 
designated by a name like “(e)”, by which it is always recognizable as 
a course-of-values. But we can neither decide, so far, whether an object 
is a course-of-values that is not given us as such, and to what function it 
may correspond, nor decide in general whether a given course-of- 
values has a given property unless we know that this property is con- 
nected with a property of the corresponding function.” 


It has been argued by Tyler Burge*! that, in framing his conception 
of class, Frege was the member of two traditions. The first was the 
intensional tradition originated by Leibniz and continued by J. S. 
Mill, G. Boole, and E. Schröder. Roughly, the main conception em- 


11 Frege, The Bastc Laws of Arithmetic: Exposition of the System, Montgo- 
mery Furth, trans. (Berkeley: California UP, 1967), p. 36. 

18 Frege, Posthumous Writings, H. Hermes, ed , P. Long and R. White, trans. 

idge: Blackwell, 1979), p. 181 

19 Frege, “A Critical Elucidation of Some Points in E. Schroeder’s Algebra der 
Logtk,” in Translations from the Philosophical Writings of Gottlob Frage, P 
Geatch and M. Black, eds. (New York: ord, 1952), pp. 86-106, esp p. 106. 

© Frege, The Basıc Laws of Arithmetic, p. 46. 

1 “Frege on Extensions of Concepts, From 1884 to 1903,” Philosophical Re- 
view, LXO (1984): 1—34. 
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bodied in this tradition was that intension, in the form of properties, 
characteristic marks, etc., determine extension. The second tradi- 
tion was the extensional one represented, among others, by Cantor, 
Richard Dedekind, and others. The main feature of this approach 
was that sets and numbers are not defined by appeal to properties, 
concepts, etc., as in the intensional tradition, but by ‘‘abstraction” 
(cf. section x below). 

In his definition of a class, Frege adopted the former conception 
and he continuously criticized the extensionalists’ idea that a class is 
formed by grouping together elements. It is important to realize 
that this antiextensionalism is compatible with Frege’s view that the 
identity. conditions for concepts are extensional. Frege’s “‘intension- 
alism”’ (i.e., his view that concepts are logically prior to their exten- 
sions) can be seen in different ways. For instance, in his letter of July 
24, 1902, Russell wrote Frege that he had difficulties “to see what a 
class really is if it does not really consist of objects but is nevertheless 
supposed to be the same for two concepts with the same ex- 
tension. ”?? 

Frege’s answer is contained in his letter to Russell of July, 28, 
1902. It is worth spelling out in some detail. He distinguished 
wholes (systems) from classes, using the following examples, among 
others: 


(1) The Romans conquered Gaul. 
(2) The class of prime numbers comprises infinitely many objects. 


In the first case, according to Frege, the Romans form a whole, that 
is, a physical body held together by relations that are essential to it, 
such as institutions, customs, laws. It is characteristic of wholes, 
Frege writes, that the parts of any part of a whole are again parts of 
the whole. Thus, the parts of the regiments are the battalions, but 
also the companies and the soldiers. 

This is not so for classes, Frege avers. A class is determined by its 
members. The only members of the class of prime numbers are the 
prime numbers. The members of a class of regiments include only 
the regiments, not any individual soldiers. A class is a logical object, 
not a physical object like a whole (system). Asked how we apprehend 
classes, Frege gave the following unambiguous answer: 


We apprehend them as extensions of concepts, or more generally, as 
ranges of values of functions (ibid., p. 141). 


mr , Philosophical and Mathematical Correspondence, Gottfried Gabriel 
et al. ae (Cambridge Blackwell, 1980), p. 139. 
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(Here and below, “ranges of values of functions” is synonymous 
with “courses of values of functions.’’) 

This is not supposed to be interpreted as the priority of concepts 
only in explaining or understanding what classes are, but as an onto- 
logical principle as well: 


... for me the extension of a concept or a class is only a special case of 
a range of values (ibid., p. 142). 


Thus, a class which is not the extension of some concept is a contra- 
diction in terms within the Fregean system, since every class has to 
be introduced via a concept. A class cannot be given through its 
elements, only through a rule that determines elementhood.* 

Now, there were at least two different reasons for Frege to reject 
the individuation of classes through their members. The first one is 
connected with the definition of the empty class: 


Of course, one must not then regard a class as constituted by the 
objects (individual, entities), that belong to it; for removing the objects 
one would then also be removing the class constituted by them. In- 
stead, one must regard the class as constituted by the characteristic 
marks, i.c., the properties which an object must have if it is to belong to 
it. It can then happen that these properties contradict one another, or 
that there occurs no object that combines them in itself. The class is 
then empty but without being logically objectionable for that reason.* 


The second reason has to do with the individuation of infinite 
classes. Frege thought that an infinite class cannot be given solely by 
its members because of the finiteness of the human intellect. In- 
stead, such a class can be justified only as derived from a concept 
involved in thought.” 
VUL FREGE LACKED THE NOTION OF ARBITRARY SET 

The intensionalists’ and, in particular, Frege’s conception of set ex- 
plained above was criticized by the so-called extensionalists, that is, 
set theorists like Cantor and Dedekind who shared with each other a 
different concept of set. Their main criticism against Frege was that 
he allowed an extension for every concept, which led to Russell’s 
paradox. For instance, in his review of Frege (1884), Cantor? criti- 


B See Frege, Posthumous Writings, p. 84; and The Foundations of Arithme- 
tic: A Logico-Mathematical Inquiry tnto the Concept of a Number, J. L. Austin, 
trans (Cambridge: Blackwell, 1960). 

* See the undated letter to Peano, in Frege, Correspondence, p. 109 

™ Posthumous Writings, p. 34; and Foundations, p. 98. 

* “Rezension der Schrift von G. Frege Die Grundlagen der Arithmetik,” in 
Gesammelte Abhandlungen mathematischen und philosophischen Inhalts, E. 
Zermelo, ed. (Berlin: Springer, 1932) 
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cized Frege for defining classes as extensions of concepts. He 
pointed out that, in general, “the ‘extension of a concept’ is some- 
thing quantitatively completely undetermined” and for this reason 
not all extensions of concepts are sets” (ibid., pp. 440-1). 

In the same spirit, E. Zermelo?” wrote: 


... it follows further as already appears in a much simpler way from 
Russell’s antinomy, to be sure that it is not permissible to treat the 
extension of every arbitrary notion as a set and therefore that the 
customary definition of a set is too wide (ibid., p. 107). 


Even if Frege’s definition of a set was too wide in one direction, in 
another direction it was too narrow. The inseparable Fregean link- 
age between sets and their defining properties prevented him from 
generating new sets by operations acting solely on the members of 
the already existing sets. This fact is correctly emphasized by Hao 
Wang” when he writes: 


He [Frege] identifies sets with their extensions and treats them as indi- 
viduals (on the same level with individuals). This suggests immediately 
the idea of a type hierarchy of extensions, since extensions of predi- 
cates seem to be more closely related to predicates than to individuals 
.... But such a conception has little that is positive to say about how 
sets are to be generated (ibtd., p. 210). 


Frege was thus unable to operate with arbitrary extensions as Can- 
tor and Dedekind did. What is more, he tied extensions so closely to 
concepts that he could not reach the modern (generative) concept 
of set which or more or less completely implemented by the axioma- 
tized set theories of Zermelo, J. von Neumann, and others. Al- 
though in these theories strong limitations on the formation of sets 
still exist (as compared with Cantor’s system), still one has some 
liberty, starting with a given set, to form new sets. Thus, one basic 
ingredient of the standard interpretation, i.e., the notion of arbi- 
trary set, is lacking from Frege’s system. 

One might still claim that, although Frege could not form arbi- 
trary sets merely by applying certain operations on the members of 
an already existing set, there is nothing in Frege’s theory which 
prevents the arbitrary sets to exist as value ranges of some suitable 
concepts. Hence, for Frege, the question whether the standard in- 


Y “Neuer Beweis für die Möglichkeit einer Wohlordnung,” Mathematische 
Annalen, LXV (1908): 107-28. 
™ From Mathematics to Philosophy (New York: Routledge, 1974). 
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terpretation is correct becomes simply and solely the question 
whether there is a concept (function) for each potential extension, 
that is, whether for each class (class-in-extension) there is a concept 
whose course of values this class is. Thus, the answer to the question 
“Which of the two interpretations Frege was committed to?” de- 


pends crucially on Frege’s conception of function. 
IX. THE OBJECTIVE EXISTENCE OF FUNCTIONS 
DOES NOT IMPLY STANDARDNESS 


Michael Dummett®™ claims unambigously, not only that Frege oper- 
ated with higher-order quantification, but that for Frege the range 
of a function quantifier included all “arbitrary functions”: 


For Frege had not the slightest qualm about the legitimacy or intelligi- 
bility of higher-level quantification: he used it from the first, in Be- 
griffsschrift, freely and without apology, and did not even see first- 
order logic as constituting a fragment having any special significance 
(ibid., p. 223). 


And it is true enough, in a sense, that, once we know what objects there 
are, then we also know what functions there are, at least, so long as we 
are prepared, as Frege was, to admit all ‘arbitrary fumctions’ defined 
over all objects (ibid., p. 177). 


That Frege can give no consistent justification for interpreting second- 
level quantification as varying over the full impredicative totalities of 
first-level concepts, relations and functions is, of course, no ground for 
denying him the right to employ second-level quantification at all 
(ibid., p. 541). 


Dummett does not provide any references to Frege’s texts in sup- 
port of his claims. It is nevertheless not difficult to see what he has 
in mind. On page 177, Dummett is expounding Frege’s idea of the 
objective existence of functions, his ‘conception of the function as 
belonging to the ‘realm of reference’.” He points out that ‘‘for 
Frege the only possible route to grasp what a function is is via an 
understanding of the working of functional expressions.” But this 
does not mean, Dummett in effect avers, that functions according to 
Frege are merely counterparts to functional expressions, that they 
are not inhabitants of the real universe. Dummett is undoubtly 
correct here. But what he is doing is merely a way of asserting the 
objective, nonlinguistic existence of functions independently of lan- 
guage. They are, in Dummett’s words, mappings of objects on ob- 


P Frege’s Philosophy of Language (London. Duckworth, 1973). 
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jects. Hence Dummett’s real point is that Frege maintains the non- 
linguistic existence of functions. Nothing follows from this point 
concerning Frege’s alleged recognition of arbitrary function, i.e., of 
any possible correlation of objects and arguments. Dummett’s move 
from the mode of existence of functions as objective correlations of 
objects to other objects to the claim that there was such a correla- 
tion or mapping for each possible pairing of argument values and 
function values is a non sequitur. He is apparently assuming that 
there is only one possible nonstandard interpretation, the one Hen- 
kin later considered. For only on this assumption could Dummett 
infer from Frege’s rejection of this particular nonstandard interpre- 
tation that Frege accepted the standard one. At best, Dummett 
manages to refute Carnap’s view (see note 5 above) that, according 
to Frege, only provably existing functions really exist. 

The other quotes from Frege which Dummett uses are not any 
more convincing. On the contrary, on page 541, Dummett even 
admits that Frege’s “theory of reference cannot of itself, provide 
any temptation to violate the vicious-circle principle,” that is, to 
assume the existence of impredicative totalities. But the very point 
of the standard interpretation is to assert the existence of every 
possible class independently of its characterization in language. 
Hence the standard interpretation would in fact provide a tempta- 
tion of the very kind Dummett denied Frege.” 

If Dummett credited Frege with the notion of arbitrary function, 
Nino Cocchiarella”’ credits him with no, more and no less than the 
formulation of standard second-order logic: 


. both Frege and Russell maintain that the logistic framework within 
which classical mathematics is to be represented must consist at least of 
a second order predicate logic where quantification is not only with 

~ respect to the role of singular térms, but to that of predicates as well. 
Indeed, it was Frege himself who first formulated and developed stan- - 
dard second order predicate logic, and he did so precisely as a frame- 
work within which classical mathematics was tobe reduced to logic 
(thid., p. 198). 


No reference to Frege’s texts is provided by Cocchiarella. 


* In fact, the only axioms of set theory which give us sufficient liberty in form- 
ing more comprehensive sets are the axiom of subsets and the power-set axiom; 
which, as is well-known, involve impredicativites.. 

» “Frege, Russell and Logicism: A Logical Reconstruction,” in Frege Synthe- 
sized, L. Haaparanta and J. Hmtikka, eds. (Boston Reidel, 1986), pp. 197-252 
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It is true that Frege used higher-order logic as a framework for 
reducing mathematics to logic, and that this can be done only by 
giving higher-order logic the standard interpretation. But there is 
no evidence in Cocchiarella or anywhere else that Frege actually 
perceived the need of the standard interpretation for this purpose 
or actually assumed it. 

X. FREGE'S CRITICISMS OF “ABSTRACTION” 

What made possible the conception of an arbitrary set was the grad- 
ual disentanglement of the notion of set from intensional ingre- 
dients such as concepts, properties, etc., and the definition of set- 
hood in an alternative way. One such alternative way attempted by 
some foundationalists consisted in thinking of classes as mere collec- 
tions of objects obtained by ‘‘abstraction.” For instance, abstraction 
was typically the procedure through which Cantor claimed he could 
reach his notion of cardinal number which he defined as ‘‘the gen- 
eral concept which arises from the set M when we abstract both 
from the nature of the various elements m and from the order in 
which they are given” (op. cit., p. 282). But abstraction was not, as 
such, limited only to the explanation of what numbers are but could 
also be used to explain what classes are. Thus, Edmund Husserl"? 
argued that a collection (totality) consists of contents that are to be 
thought without any relation or connection between them. 

Frege criticized consistently and constantly other logicians’ and 
mathematicians’ attempts to introduce arbitrary classes by abstrac- 
tion. For instance, he criticized Husserl for mistakenly defending a 

"notion of totality in which “the contents are merely supposed to be 
thought or presented together, without any relation or connection 
whatever being presented between them.” For his part, Frege con- 
fesses that he is unable ‘to disregard the connection without losing 
the whole” (#bid.).. 

Frege’s forceful criticisms of abstraction have usually been taken 
to be merely a part of his overall campaign against psychologism in 
logic and mathematics.” This is.a dangerous oversimplification. 
Frege’s arguments, no matter how they were formulated, had a di- 
rect impact on the understanding of his substantial views of logic. 


” Philosophie der ene ie oe und logische Untersuchung, 
Vol. I (Halle ad. Saale: Pfeffer, 1891), p. 79 

8 “Review of Dr. Husserl’s Philosophie der Artthmetik,’’ Mind, LXXX (1972), 
p 329. ; 

M For some of Frege’s criticisms against psychologism, see thid., pp. 321-37; 
Foundations, p 63; Collected Papers in Mathematics, Logic and Philosophy, 
Brian McGuinness et al., eds. (Cambridge: Blackwell, 1984), pp. 178-81. 
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They are, in effect, arguments for the primacy of concepts over 
their potential extensions. Hence they have an indirect impact also 
on his views on higher-order logic. They show that Frege did not 
appreciate the notion of arbitrary set, and they show once more how 
unwilling he was to accept what later was to be called the standard 
interpretation. 
XI THE TWO FACETS OF EXTENSIONALITY 
AND THE PROBLEMS OF SET THEORY 

Mathematically and logically, the most important target of Frege’s 
criticisms of abstraction was undoubtedly Cantor. A delicate touch is 
needed, however, in discussing the conceptual history of set theory 
and its relations to Frege’s ideas. Earlier in this paper, we pointed 
out the deep kinship between the standard (‘‘extensional”’) interpre- 
tation of higher-order logics and the intuitions on which set theory 
was originally based. This standardness, we pointed out, was lost in 
the course of the development of the contemporary (first-order) 
axiomatic set theory. 

Yet a perspective that at first sight might appear to be diametri- 
cally opposite to ours has been presented and ably argued for by 
Hallett (ep. cit.). In his story, set theory was originally afflicted with 
a serious case of intensionalism. This malady was gradually cured in 
the course of the development of axiomatic set theory, which slowly 
led to the all-too-familiar cumulative conception of sets. According 
to Hallett, Frege was on the side of extensionalist angels, and not 
the bogeyman that he may look like in our scenario. Indeed, Hallett 
(op. ctt., p. 46) brackets Frege in so many words with those nine- 
teenth-century logicians and mathematicians, exemplified by Boole 
and Dedekind, who “operated with arbitrary extensions.” 

Furthermore, according to Hallett, the liberation of set theory 
from the dominance of the intensionalist conception was a painful 
and slow process. This can be seen from the fact that ingredients of 
the intensionalist conception are plenty im Cantor and still, surpris- 
ingly, in Zermelo. For instance, in early Cantor a set was defined as a 
“totality of elements which can be united to a whole through a 
law.’ Also, in Zermelo’s formulation of the subset (separation) 
axiom, what yields the subset is a ‘‘property.”™ It is only with Sko- 
lem that set theory was definitively purged of intensional entities 
like properties, which were replaced by predicates of the given . 


language.’ 
% “Über unendliche, lineare Punk igfaltigkeiten 5,” in op. cit 
% See Zermelo, p. 183; and Hallett, p 267. 
Y A more detailed description of ’s ideas is contained in Hallett, p 281 
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It is nevertheless easy to reconcile Hallett’s perspective with ours. 
What has happened is that Hallett tells only a half of the full story, 
whereas we are in this paper focusing on the other half. We have 
been asking: Does there exist, according to Frege or whoever else, a 
concept for each potential extension? Hallett is restricting his atten- 
tion to an entirely different and in fact reverse question: Does there 
exist an extension for each concept? As was pointed out above, 
both questions have sometimes been discussed under the undif- 
ferentiated heading of “‘extensionalism,”’ and even assimilated to 
each other. 

In the course of the actual development of set theory, it was the 
second problem that was gradually forced upon mathematicians and 
logicians. This happened largely as a response to the paradoxes of 
set theory, which showed the inconsistency of certain kinds of as- 
sumptions of the existence of sets (corresponding to apparently irre- 
proachable concepts). The result was a preoccupation with the con- 
ditions of the existence of extensions, given the corresponding con- 
cepts. What Hallett argues is that the development of the cumulative 
concept of set was looked upon as a process of liberalization of such 
existence conditions. We have no quarrel with his argument. 

Yet, as far as the other problem is concerned, the upshot of the 
development of axiomatic set theory was to give up the standard 
interpretation and hence to give up Cantor’s original vision of set 
theory. Even set theorists like von Neumann, who were beginning to 
see the distance between Zermelo’s axiomatic system and its Cantor- 
ian ancestor and who tried to construct a richer set-theoretical uni- 
verse, could never return to Cantor’s conception of arbitrary set. 
This fact is pointed out by A. Fraenkel and Y. Bar-Hillel” when they 
discuss von Neumann’s system: 


. . . The characteristic property of a function, namely being determined 
by the course of its values, is expressed by an appropriate axiom, . . . it 
states that functions with the same course of values are equal—but not 
that to each course of values a function is assigned, which would mean 
a return to Cantor’s conception of an arbitrary set (ibid., p. 99; italics 
added). 

Thus, it is important to realize that Hallett’s is not the only relevant 


perspective here nor, we believe, the most important one. What 
happens in type-theoretical higher-order logic is that the Gordian 


a 
= Foundations of Set Theory (Amsterdam: North Holland, 195 foo, Gi u 
sects 6-7. as d 


in 
~~ 
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knot of paradoxes is cut open once and for all, not unraveled by 
means of sophisticated axioms. The type distinctions guarantee that 
the existence of the appropriate extension for each concept of the 
same type can never cause any problems. The entire attention 
switches to the problem of capturing those potential extensions 
which prima facie are not extensions of any one concept. Probably 
the simplest way of accomplishing this task is to adopt a nonstandard 
interpretation. Piecemeal adoption of new first-order (or nonstan- 
dardly interpreted higher-order) axioms will not make essential dif- 
ference. What this means for someone like Frege (for whom sets 
exists only as extensions of the corresponding concepts) is that an 
important problem here is to guarantee the existence of suitable 


concepts. 
XII FREGE AND THE TWO FACETS OF EXTENSIONALITY 


In the light of these observations, you can now see that Hallett’s 
perspective is perfectly compatible with ours, as long as it is under- 
stood that he is dealing with only one kind of problem in the foun- 
dations of set theory. Indeed, we can now appreciate better the 
discussions in which Frege was involved. 

Hallett is entirely right in emphasizing that Frege wanted to as- 
sume an extension (course of values) for each and every function (in 
Frege’s sense), including every concept. In fact, that assumption is 
built into the very notation of the Grundgesetze. This assumption 
was dragged out to the open and criticized by Cantor, who declares 
it “an unhappy idea”: 


. to take “the extension of a concept” as the foundation of the 
number concept. He [Frege] overlooks the fact that in general the 
“extension of a concept” is something quantitatively undetermined. 
Only in certain cases is the “extension of a concept” quantitatively 
determined; then it surely has, if it is finite, a definite natural number, 
and if infinite, a definite power [Machtigkeit, i.e., cardinality]. For such 
quantitative determination of the “extension of a concept”, the con- 
cepts “number” and “power” must previously be already given from 
somewhere else, and it is a reversal of the proper order when one 
undertakes to base the latter concepts on the concept “extension of a 
concept” (op. cit., p. 440). 


Here we see both the fact that Frege was indeed on the side of the 
“extensionalists” in the issue on which Hallett is concentrating. At 
the same time, the quote supports our thesis of the priority of con- 
cepts for Frege, in contradistinction to the priority of classes (exten- 
sions) for Cantor. 
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But if so, is there any trace in Frege of the real problems he has to 
face? Does he realize that, if he prioritizes concepts over their ex- 
tensions, he has to provide enough concepts (and thereby enough 
sets) to satisfy the existential needs of actual mathematical theoriz- 
ing, among other things to satisfy the needs of set theory? At the 
first sight, Frege does not evince any awareness of such problems. In 
a deeper sense, however, there are signs of a such awareness in 
Frege. Indeed, the very priority of concepts over their extensions 
provides a clue. For one does not normally speak of assuming the 
existence of a concept but of defining the concept. Frege clearly 
thinks that a definition can bring new sets into a logician’s reason- 
ing. This is seen from his discussion in Grundlagen, section 88, of 
the analytic versus synthetic distinction and its relation to defini- 
tions. The upshot of this discussion is that what Frege calls defini- 
tions can extend our knowledge and hence be synthetic in Kant’s 
sense. Admittedly, Frege says that the consequences of such defini- 
tions can be proved by purely logical means and hence are analytic 
in his sense. But here we must remember that for Frege logical 
truths are simply the most general truths about this world. There- 
fore, they presumably can involve existential truths. It seems to us 
that definitions are for Frege a way of eliciting the existential impli- 
cations of what for him were purely logical truths. 

This is strongly suggested in Frege’s own illustration: 


If we represent the concepts (or their extensions) by figures or areas in 
a plane, then the concept defined by a simple list of characteristics 
corresponds to the area common to all the areas representing the de- 
fining characteristics; it is enclosed by segments of their boundary 
lines. With a definition like this, therefore, what we do—in terms of 
our illustration—is to use the lines already given in a new way for the 
purpose of demarcating an area... But the most fruitful type of defini- 
tion is a matter of drawing boundary lines that were not previously 
given at all. What we shall be able to infer from it, can be inspected in 
advance; here, we are not simply taking out of the box again what we 
have just put into it (Foundations, pp. 100-1). 


But if so, we can predict what the focus must have been for Frege’s 
doubts about the way in which Cantor and others were introducing 
the sets they needed in their theories. The prediction is that Frege 
must have been worried about the way these logicians and gentle- 
men defined their basic concepts. This would-be prediction is in- 
deed fulfilled. For Frege’s criticism of Cantor is leveled prominently 
at his use of definitions. This criticism is found in an especially sharp 
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formulation in Frege’s originally unpublished review of Cantor.” It 
involves prominently an attack on Cantor’s notion of abstraction, 
thus indicating its tie to Frege’s rejection of the standard interpre- 
tation. 

The relation of Frege’s notion of definition to existence assump- 
tions deserves a fuller examination than we can give it here. But 
even without such a further investigation, his discussion of other 
logicians’ and set theorists’ definitions is suggestive of this half- 
awareness of the crucial (for him) problem of the existence of con- 
cepts. Unfortunately, this problem was swept out of the focus of 
Frege’s attention by the more immediately urgent problem of the 


existence of extensions for (so to speak already existing) concepts. 
XUL “CONSISTENCY DOES NOT IMPLY EXISTENCE” AND 
THE STANDARD VERSUS NONSTANDARD DISTINCTION 


There are other ways of seeing that Frege was inextricably commit- 
ted to a nonstandard interpretation. On several occasions, Frege 
bases his arguments on the principle that we must not infer exis- 
tence from consistency (freedom of contradiction). For instance, he 
writes: 


Suppose we have proved that in an equilateral, rectangular triangle the 
square on the hypotenuse is twice as large as the square on one of the 
sides, which is easier than proving Pythagoras’ theorem in general. We 
can now conclude that the proposition does not contain a contradic- 
tion either within itself or in relation to the geometrical axioms. But 
can we conclude further: therefore, Pythagoras’ theorem is true? I 
cannot accept such a method of inference from lack of contradiction 
to truth.” 


The principle that Frege rules out appears to be what would be 
needed to guarantee that there is a concept for any class (possible 
extension). What this possibility means in this case is that, if a con- 
cept should somehow be introduced which actually has this class as 
its extension, no contradiction would come about. Hence the for- 
bidden principle could guarantee the existence of the ‘‘missing” 
concept. 

Admittedly, Frege does not in so many words apply his principle 
“existence does not follow from consistency” to deny that there isa 
concept for each class, having this class as this extension. But the 


=» “Entwurf zu einer Besprechung von Cantors Gesammelten Abhandlungen 
zur Lehre vom Transfiniten,” in Gottlob Frege: Nachgelassene Schriften, H. 
Hermes, F. Kambartel, and F. Kaulbach, eds. (Hamburg: Meiner, 1969), pp. 76— 
80 


i Correspondence, p. 48. 
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variety of different cases to which he does apply his principle 
strongly suggests that he was committed to applying it also so as to 
rule out the standard interpretation. 

In the Grundlagen, Frege admittedly uses his principle only to 
criticize H. Hankel’s somewhat cavalier procedure in extending the 
concept of number by using as his only firm guideline the preserva- 
tion of the main laws governing numbers. It might seem that no 
sweeping conclusions can be drawn from Frege’s facile triumph. But 
surely Frege could and should have tried to criticize more significant 
thinkers than Hankel in the same way. What Hankel does is not so 
entirely different from Dedekind’s procedure. Dedekind, too, is in 
effect relying on the consistency of the result when he introduces a 
real to correspond to any cut, even when it does not correspond to a 
real number. And if Frege had so extended his criticisms to Dede- 
kind, their force would once again come very close to the denial of 


the standard interpretation. 
XIV. FREGE DISAVOWS THE CONCEPT OF ARBITRARY FUNCTION 


The most convincing evidence for ascribing a nonstandard interpre- 
tation to Frege is mediated by the connection pomted out earlier 
between the standard interpretation and the idea of arbitrary func- 
tion. Frege’s conception of function is shown not only by what he 
says about it but also how he applies the concept of function. In his 
early paper, “Methods of Calculation Based on an Extension of the 
Concept of Quality,’*! Frege claims to have proved “some general 
propositions ... valid for all functions and functional systems” 
(ibid., p. 92). Yet in actually proving his results, Frege iş freely 
assuming that the functions he is dealing with are differentiable 
(ibid., p. 60). In other words, Frege’s “all functions” are not all 
arbitrary functions in the usual sense of the word. Frege’s concep- 
tion of function is obviously light years away from the concept of 
arbitrary function in the sense that is related to the standard inter- 
pretation. 

Even more striking than the narrowness of Frege’s-working con- 
cept of a function is his obliviousness to the problems afflicting this 
notion. As was noted earlier, discussions concerning the general 
concept of function were perhaps the most important ingredient in 
the lively foundational discussions among mathematicians in Frege’s 
lifetime. Hence when Frege takes up in so many words the critical 
question “What is a function?’ one might therefore expect some 


41 In Collected Papers, pp. 50-92 
42 Ibid., pp. 985-49 
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awareness of the actual cases related to mathematical research in 
such areas as Fourier analysis. This expectation is not fulfilled. In- 
stead, we find in effect an unmistakeable rejection of the concept of 
arbitrary function in the relevant sense of the word. Frege formu- 
lates, modifying E. Czuber’s definition, the idea of an arbitrary 
function as follows: 


With every number of an x-range there is correlated a number. I call 
the totality of these numbers the y-range (ibid., p. 289). 


Frege calls our attention, quite correctly, to the notion of correla- 
tion involved in this concept of function: 


The heart of the matter really lies in quite a different place, viz. hidden 
in the word ‘correlated’ (ibid.). 
He notes that his opponent leaves the nature of this correlation 
completely open, quoting Czuber as saying that 


The above definition involves no assertion as to the law of correlation, 
which is indicated in the most general way by the characteristic f; this 
can be set up in the most various ways (zbid.). 


Frege immediately focuses on the idea of “law”: 


Correlation, then, takes place according to a law, and different laws of 
this sort can be thought of. In that case, the expression y is a function 
of x’ has no sense, unless it is completed by the law of correlation 
(tbtd.). 


But how can we do that? Frege answers: 


Qur general way of expressing such a law of correlation is an equation 
in which the letter ‘y’ stands on the left side whereas on the right there 
appears a mathematical expression consisting of numerals, and the let- 
ter ‘x’, e.g. ‘y = x? + 8x’ (ibid.). 
Here Frege is not speaking of “correlations” as being exhibited by 
objectively existing entities out there in the Platonic realm of num- 
bers, called functions, but by actual mathematical equations ex- 
pressed in a suitable notation. 

This by itself does not yet imply that Frege rejected the idea of 
arbitrary function. For it leaves open the possibility that laws defin- 
able by equations do not exhaust the range of all functions he is 
countenancing. Helpfully, Frege nevertheless clarifies his position 
immediately. He continues: 


Functions have indeed been defined as being such mathematical ex- 
pressions. In recent times this concept has been found too narrow. 
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However, this difficulty could easily be avoided by introducing new 
signs into the symbolic language of arithmetic (ibid., p. 290). 


This is a mind-boggling statement. It shows Frege’s mind at work. It 
shows that there is no niche in his world for the notion of an arbi- 
trary function in the sense in which Euler, Czuber, and others were 
thinking of this notion. For to think that the range of such arbitrary 
functions could be covered by introducing new signs into the nota- 
tion of arithmetic is not to understand this notion. Hence we can 
safely conclude that, in the logic to which Frege tried to reduce all 
mathematics (except geometry), there is no room for the idea of an 
arbitrary “‘function-in-extension” and hence no room for the stan- 
dard interpretation. 

It is not that Frege accepts uncritically the characterization of the 
largest relevant class of functions by means of equations. Indeed, he 
immediately goes on to criticize the quoted characterization. But 
tbis criticism has nothing to do with the narrowness of the class of 
functions so definable. It deals with the fact that the proposed char- 
acterization obscures the ‘‘distinction between sign and the thing 
signified,” and hence obscures the objective existence of functions 
independently of any particular way they are formally expressed. 

What is interesting about this justified criticism by Frege is that its 
upshot obviously has nothing to do with the range of functions that 
exist or do not exist. Problems of that kind, Frege has already pro- 
claimed, “can easily be avoided by introducing new signs into the 
symbolic language of arithmetic.” Hence Dummett and others are 
simply wrong if they think that the objective existence of functions, 
or even their mode of objective existence as correlations of objects 
with objects, prejudices Frege in favor of the standard interpre- 
tation. 

At the same time, the latest displayed quote from Frege boggles 
the mind in that it demonstrates Frege’s alienation from the actual 
working problems in the foundations of mathematics of his time. 
Frege is in effect telling Dirichlet, Weierstrass, Dedekind, H. 
Poincare, et alia that the problems they have been agonizing about 
could easily by solved by introducing new signs into the notation of 
arithmetic. It is perhaps not surprising that Frege’s foundational 
ideas did not attract more attention among the practicing founda- 
tionalists than they in fact did. 

XV. FREGE’S REDUCTIONIST PROGRAM IS A NONSTARTER 
What have we accomplished so far? And why have we done what we 
have done? Why worry about whether Frege accepted the standard 
or nonstandard interpretation of higher-order logic? After all, the 


314 THE JOURNAL OF PHILOSOPHY 


distinction was articulated much later, and hence Frege could 
scarcely have expected to pay much attention to this distinction, 
which might seem rather pedantic. Yet there is a massive reason for 
raising the question we have raised. The evaluation of Frege’s en- 
tire life work depends crucially on the answer. Frege tried to re- 
duce mathematics to logic. In the process, he used crucially higher- 
order quantifiers. For instance, he defined “The number which be- 
longs to the concept F is oo,” as “there exists a relation which 
correlates one-to-one the objects falling under the concept F with 
the finite Numbers” (Foundations, p. 98). Despite the fact that 
Frege claimed that his terminology “diverges to some extent” from 
Cantor’s, in the Grundlagen he welcomes Cantor’s achievements 
because “they have led to the construction of a purely arithmetical 
route to higher transfinite Numbers (powers)” (ibid., p. 96). It has 
been emphasized in this paper, not for the first time, that the diver- 
gence between Frege and Cantor is much deeper than merely a 
terminological one. The divergence spreads from two radically dif- 
ferent conceptions of a set, one that ties sets to concepts and proper- 
ties, the other that generates sets freely through operations acting 
on the members of some already existing sets. Likewise, it makes 
more than a terminological difference whether one defines a one- 
to-one correspondence between two sets as any arbitrary one-to-one 
function in extension mapping them on each other, as Cantor envis- 
aged it, or as a similar function picked up by a concept, as Frege in 
effect did. 

Also, when Dedekind characterized the real numbers in effect by 
means of the cut principle, which says that every bounded (from the 
above) set of reals has a least upper bound, this characterization is a 
categorical one only if the sets involved are arbitrary ones and not 
only such sets as are courses of values of concepts expressible in the 
language of arithmetic. Thus, there is a deep sense in which Frege’s 
system is not adequate for interpreting results in contemporary set 
theory and mathematical theorizing, for instance in real analysis. 

XVI. FREGE’S PLACE IN HISTORY 
Even more generally, serious questions must be raised in the light of 
our results and of other results about Frege’s stature as a philoso- 
pher of mathematics and as an analyst of the actual working founda- 
tions of mathematics. Frege created our received conception of lo- 
gic, not just in the sense of conceiving the general idea but in the 
sense of actually implementing it. This is a lasting achievement. Yet 
it turns out that his creation, the customary idea of first-order logic, 
is in certain respects too restrictive and -perhaps even arbitrarily 
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restricted. As Hintikka*® has recently shown, Frege and others have 
arbitrarily excluded quantifier independence from their logic. As we 
have seen, Frege’s failure to grasp the idea of the standard interpre- 
tation of higher-order logic turns his entire foundational project 
into a hopeless daydream. Elsewhere, Hintikka“ has pointed out 
how totally blind Frege was to what must be considered the most 
significant line of development in modern foundational studies, the 
gradual systematization of the model-theoretic viewpoint. In particu- 
lar, Frege totally failed to understand the early uses of this viewpoint 
by David Hilbert, especially in Hilbert’s work in the axiomatic foun- 
dations of geometry. 

As Hilbert’s work was easily the most influential kind of founda- 
tional work among mathematicians and scientists in the early de- 
cades of this century, this means that Frege was totally blind to one 
of the most important lines of thought in the foundations of mathe- 
matics in this century. What all of this adds up to is that Frege failed 
to relate his own work to much, not to say most, of the foundational 
work that really mattered to actual mathematical research and to 
mathematicians’ own efforts to clarify the fundamentals of their 
own discipline. It seems to us unfortunate that philosophers habitu- 
ally go to Frege for their problems and for the concepts that could 
help us to cope with them. Frege was far too myopic to be a fruitful 
source of concepts, ideas, and problems. It is likely to turn out that 
some of his contemporaries can offer far better ideas for future 
work in the foundations of mathematics than Frege did. 
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University Press, 1990. xxi, 293 p. Cloth $54.95, paper $17.95.* 


Allen Wood’s latest book is the best existing treatment of Hegel’s 
ethical thought by a philosopher in the Anglo-American tradition. It 
is a sympathetic and comprehensive reconstruction of the philosoph- 
ical foundations that underly Hegel’s views on society, politics, and 
morality. The philosophical foundations investigated here are not 
the speculative metaphysics of Hegel’s Logic (which Wood regards 
as a dead enterprise) but Hegel’s ethical theory: his understanding 
of the good and its relation to reason, freedom, and human happi- 
ness. Wood’s clarity and philosophical expertise make this book a 
highly valuable resource for students of Hegel, both beginning and 
advanced, as well as for analytic philosophers who want a compre- 
hensible account of Hegel’s ethical thought which is free of ob- 
fuscating jargon. Wood successfully demonstrates both that Hegel 
has an ethical theory and that it is a powerful view worthy of our 
attention. 

Wood classifies Hegel’s normative ethical theory as a “‘self-actuali- 
zation theory,” which he distinguishes from both deontological and 
teleological positions. An ethical theory based on self-actualization 
regards as good those actions and ends which are espoused by ra- 
tional subjects attempting to realize their conceptions of their own 
essential nature. In Hegel’s theory the subject’s essence that is to be 
actualized is freedom or, more accurately, freedom in its various 
configurations. The latter modification is necessary, since freedom 
for Hegel assumes a variety of forms, and since the ordering of these 
forms plays a crucial role in determining the architectonic of his 
theory. The principles of the three main parts of Hegel’s theory— 
abstract right, morality, and ethical life (Sittl:chkett}—are derived 
from three distinct (but complementary) ways in which subjects of 
the modern world understand themselves as free. Because Hegel’s 
position is grounded in a single fundamental good (freedom), it 
comes close to being a teleological theory that posits freedom as the 
telos of ethical action. Yet Wood rejects the teleological label be- 
cause it mistakenly suggests that ethical action can be understood as 
a means to some end external to the action. On Wood’s reading of 
Hegel, actions have ethical value not because they are instrumental 


* I would hke to thank John Rawls for his comments on an earlier version of 
this review 
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to the achievement of some independent good, but because acting 
through the forms of freedom is the way in which a subject objecti- 
fies, or gives reality to, its conception of itself as free. (Owning 
property, for example, has ethical significance not because it has an 
independently valued end as its consequence, but because the will’s 
relationship to its property satisfies a conception the modern sub- 
ject has of itself as a free chooser of ends.) Who, then, is the subject 
of self-actualization: Spirit or human individuals? To this question 
Wood replies that it is both, and that Hegel’s emphasis on the notion 
of Spirit does not diminish the importance of individuals in his 
theory, since the self-actualization of Spirit occurs only through the 
self-actualization of individuals. We should wonder, however, 
whether this fact alone warrants Wood’s conclusion that “‘Hegel’s 
ethical theory is after all founded on a conception of individual 
human beings and their self-actualization’”’ (21, emphasis added). I 
return to this issue below in my discussion of Hegel’s relation to 
liberalism. Although Wood’s book contains many ideas worthy of 
careful attention (including an excellent treatment of freedom, hap- 
piness, and their relationship), I shall restrict myself to commenting 
on two problems raised by his interpretation. 
THE EMPTINESS OF THE CATEGORICAL IMPERATIVE 

The best known of Hegel’s ethical doctrines is his charge that the 
basic principle of Kantian morality, the categorical imperative (CI), 
is empty and therefore unable to serve as a criterion for what is 
morally right and wrong. This view has received a good deal of 
attention from commentators, most likely because there is some- 
thing intuitively compelling in the claim that the formal nature of 
Kant’s pure practical reason makes it unable to supply the notion of 
moral duty with determinate content. Yet even sympathetic inter- 
preters have had a difficult time making sense of this critique, for 
the specific arguments Hegel marshalls in support of it (in his fa- 
mous examples of aiding the poor and returning the deposit of the 
deceased) seem to rest upon a gross misunderstanding of Kant’s 
procedure for testing an agent’s maxims. Wood makes a provocative 
suggestion as to how Hegel’s critique must be understood if we are 
to be able to take it seriously. His interpretive strategy is to distin- 
guish between two versions of the emptiness charge, the first of 
which is said to fail as a criticism of Kant, whereas the second alleg- 
edly succeeds in pointing out an important defect in Kant’s moral 
philosophy. 

The first version of the emptiness charge, the one most commonly 
attributed to Hegel, claims that the CI (especially in its first formu- 
lation in the Groundwork) cannot distinguish between permissible 
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and impermissible maxims: from one perspective all maxims pass the 
test of universalizability; from another all maxims fail. Wood claims 
not only that Hegel’s particular examples fail to support his charge 
but that the critique itself is untenable. By reconstructing Kant’s 
example of the unreturned deposit, Wood argues that the CI proce- 
dure, properly understood, is able to rule out some maxims as im- 
permissible. Although the procedure does run into serious problems 
(157, 160), emptiness in the sense alleged by Hegel is not one of 
them. The second version of the emptiness charge, which Wood 
defends as a cogent critique, is directed not at Kant’s formulations 
of the CI but at something more fundamental: the philosophical 
position from which Kant attempts to derive and defend this basic 
moral principle. On this reading, Hegel’s claim is that no contentful 
ethical principles can be derived from the abstract notions of reason 
and subjectivity which ground Kant’s moral philosophy. Wood 
fleshes out this claim in an attack on Kant’s attempt to arrive at a 
formulation of the supreme principle of morality starting merely 
from the concept of an objective practical law, or a categorical im- 
perative. Wood’s critique of Kant’s derivation of the CI is plausible 
and makes good philosophical sense of Hegel’s charge of emptiness. 
Moreover, his interpretation suggests a way of understanding 
Hegel’s well-known examples which does not attribute to Hegel a 
simplistic distortion of Kant’s position. Their point is not to demon- 
strate the emptiness of the CI as it is in fact formulated by Kant but 
rather to show the emptiness of the only criterion (logical noncon- 
tradictoriness) available to Kant, given the conception of reason 
from which he intends to derive his basic moral principle. 

This interpretation raises an important question: How does 
Hegel’s own position enable him to avoid the problem of emptiness 
that plagues Kant? Hegel’s solution presumably lies in his distinctive 
notion of ethical life and its conception of subjectivity, which de- 
fines agents in terms of the roles they occupy in the basic institutions 
of a rational society. Although Wood offers a number of helpful 
suggestions as to how the perspective of ethical life is supposed to 
supply morality with content, two important unclarities remain in 
his account of Hegel’s mature conception of morality and its rela- 
tionship to a theory of social institutions: First, what precisely are 
the resources that Hegel’s own moral standpoint has at its disposal? 
Does Hegelian morality simply do away with the kind of practical 
reasoning involved in Kant’s CI procedure, or does it retain some- 
thing like the universalizability criterion which, when coupled with a 
commitment to the rational social institutions of ethical life, be- 
comes capable of discriminating between permissible and impermis- 
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sible maxims? Second, how are we to envision the division of labor 
by means of which the principles of morality collaborate with those 
of ethical life to yield a determinate set of duties? Is it the case, as 
Wood sometimes suggests (173, 210-1), that the standpoint of mo- 
rality gives rise itself to a set of moral duties (e.g., acts of charity to 
strangers), which are supplemented by ethical life in the sense that 
the latter provides an additional set of distinctively ethical duties 
(e.g., family obligations, respect for civil law)? On this picture, moral- 
ity is incomplete without the institutions of ethical life but not, 
strictly speaking, empty. Or is it rather (136, 172) that the moral 
standpoint is indeed empty in the sense that the categories it em- 
ploys (the “rights” and “well-being” of others) lack all content when 
considered in abstraction from ethical institutions? This reading 
(which seems to accord best with §§135R and 148R of the Philo- 
sophy of Right (PhR)) holds that morality on its own “provides us 
with no determinate criteria of action” and that the latter can be 
“supplied only by the more concrete social relationships of ethical 
life” (136). And, if the latter is indeed Hegel’s view, can one defend 
his thesis concerning the emptiness of the purely moral standpoint, 
given Wood’s argument in response to the first version of the empti- 
ness charge that Kant’s CI can provide contentful criteria for right 


action? 
HEGEL AND LIBERALISM 


One topic that runs throughout this book is Hegel’s relationship to 
liberalism. Wood provides a valuable corrective to current interpre- 
tations by emphasizing those aspects of Hegel’s thought which link 
him to the liberal and Enlightenment traditions and distance him 
from contemporary communitarianism. On Wood’s interpretation, 
Hegel espouses political institutions that are “pretty much” those 
defended by liberals but rejects the philosophical grounds on which 
liberals justify those institutions. Although Wood’s treatment of 
Hegel’s relationship to liberal theory is generally illuminating, I shall 
focus on one aspect of his discussion that seems to me problematic: 
the question of whether, and in what sense, a collective good (here 
the good of the state) is reducible to the good of the individuals who 
compose it. According to Wood, Hegel subscribes (along with lib- 
erals) to “the virtual tautology that collective goods have value be- 
cause they have value for individuals” (259). Hegel is said to di- 
verge from liberal theories in asserting that ‘‘collective goods have a 
value in themselves not reducible to the private good of individ- 
uals” (258, emphasis added). Presumably, Wood is pointing here to 
Hegel’s idea that the value of the state resides in something beyond 
its function of enabling citizens to attain ends they possess as private 
individuals in abstraction from their political relations to others. 
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Participation in the political sphere is more than merely a means for 
achieving private goods; as the activity through which citizens deter- 
mine their own social existence, it is valuable in itself. But this aspect 
of Hegel’s position does not in fact distinguish him from much of 
the liberal tradition. To cite just two examples, neither Rousseau 
nor John Rawls would deny that the value of the political realm 
extends beyond its instrumental value for the attainment of private 
ends. If there is a difference here, I believe it is more accurately 
located in Hegel’s insistence that individuals achieve their highest 
good (as practical beings) in political activity. 

An even more fundamental difference comes to light if we recon- 
sider Wood’s claim that for Hegel collective goods have value be- 
cause they have value for individuals. It is certainly Hegel’s view that 
a necessary condition of a good’s having collective value is that it be 
valuable for individuals. But it does not follow from this that a col- 
lective good has value only because of its value for individuals. This 
point is significant, because it casts doubt on Wood’s conclusion 
that Hegel’s ethical theory is ultimately founded on the notion of 
individual self-actualization. Indeed, this seems to me a misleading 
characterization of Hegel’s view, for his conception of individual 
self-actualization is derivative of his understanding of the self-actu- 
alization of Spirit. Individual self-actualization is an important no- 
tion for Hegel, but it is understood in such a way that individuals 
realize their essence not as individuals but as members of a collec- 
tive, i.e., as vehicles of Spirit. Individuals achieve their highest free- 
dom by participating in and identifying with a social whole that is 
itself a complete and self-sufficient good. Furthermore, Hegel's or- 
gantc conception of the social whole and the collective good re- 
quires that there be a substantial differentiation, even an ethical 
hierarchy, among members of the state which violates the basic lib- 
eral view of citizens as morally equal individuals (PhR, §308R). The 
peculiar nature of Hegel’s understanding of individual self-actualiza- 
tion is brought out best in his treatment of social groups that are 
denied full participation as citizens. The peasant, for example, occu- 
pies a role in civil society that is incompatible with the full develop- 
ment of the human capacities for rationality and individual self-de- 
termination. Yet the individual peasant realizes his essence as free in 
the rational state. The crucial point is that he does so, not because 
he achieves autonomy’as an individual, but because he is an integral 
part of a community that, as a whole, is self-sufficient and therefore 
undetermined by anything other than itself. ; 
FREDERICK NEUHOUSER 
Harvard University 
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Realism in Mathematics. PENELOPE MADDY. New York: Oxford 
University Press, 1990. ix, 204 p. Cloth $39.95.* 


Maddy defends a form of realism intended to be true to mathemati- 
cal practice against this objection: on a realist account, mathematics 
would be about abstract objects. Since we cannot interact with ab- 
stract objects, mathematical knowledge would then be impossible. 
She seeks to explain how mathematicians ordinarily obtain mathe- 
matical knowledge in a way that is consonant with and makes use of 
the best contemporary scientific theory of whatever turns out to be 
relevant. 

Some readers may be puzzled at this point. What Maddy calls the 
Quine/Putnam indispensability argument explains the possibility of 
mathematical knowledge: mathematical objects are indispensable to 
our best theory of the natural world, and that theory yields mathe- 
matical knowledge. But mathematicians do not normally make use 
of extramathematical scientific theories to obtain mathematical 
knowledge. The Quine/Putnam indispensability argument does not 
explain how mathematical knowledge is actually obtained. 

The obvious method of obtaining mathematical knowledge— 
proving theorems—is no help: a proof shows that if the axioms are 
true then so is a theorem. But how do we know that the axioms are 
true?! Mathematicians seem to use a faculty of mathematical intu- 
ition that plays a role for mathematics analogous to that sense per- 
ception plays for physics. Maddy’s problem is (at least in part) that of 
giving a naturalistic explanation both of how mathematical intuition 
is possible and of the role it plays in mathematical knowledge. 

We have been discussing mathematics generally, but, “to bring 
the project down to more manageable size,” Maddy only presents 
her solution for one branch of mathematics—-set theory. Set theory 
poses a particularly interesting challenge for any theory of how we 
come to accept axioms, since its practitioners actively seek out new 
axioms. 

Maddy’s solution to the problem of mathematical intuition is sim- 
ple: we see some small simple sets in the same kind of way that we 
see some medium-sized physical objects. The mathematician’s intu- 
itions about sets are essentially identical in kind to the natural scien- 
tist’s intuitions about physical objects: both are products of the ap- 


* I would like to thank Bernard Berofsky and Sidney Morgenbesser for useful 
discussions. 

1! There is a long tradition of denymg the importance or even the coherence of 
this question, and a long tradition of arguing that that denial is untenable. Maddy 
reviews the situation and concludes that the denial is untenable. 
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paratus we use to perceive the sets and physical objects. The rele- 
vant sense of “seeing” is that in which seeing a medium-sized 
physical object enables the person who sees it to gain appropriate 
perceptual beliefs about it. The work of Piaget and others has taught 
us that the ability to see a medium-sized physical object is the result 
of a long and complex developmental process. According to Hebb’s 
neurophysiological theory, this developmental process involves suc- 
cessive alterations in neural structuring in the brain which culmi- 
nate in the acquisition of a complex cell assembly that serves as a 
physical-object detector. To be capable of seeing a physical object is 
just to have such a physical-object detector. Our neuropsychological 
theories will surely change, but there is reason to believe that what- 
ever form our account of vision comes to take will allow Maddy’s 
arguments to go through without essential modification. Maddy 
tries to show that one can give an account of how we perceive sets 
similar to the one for physical objects. f 

Take Maddy’s example of Steve’s opening a carton of eggs and 
seeing that the set of eggs in it has three members. By analogy to 
perception of physical objects, she must show that there is such a 
set, that Steve acquired perceptual beliefs about it, and that the set 
participated in the acquisition of those beliefs in the right way. The 
belief that there are three eggs is perceptual, not inferential, as is 
indicated, for example, by reaction-time studies. But why should we 
take the belief to be about a set? As Frege taught us, the stuff that 
makes up the three eggs also makes up some huge number of mole- 
cules. Whatever three is a property of, it is not a property of the 
underlying matter. The usual candidates include aggregates, con- 
cepts, classes, and sets. Quine/Putnamism enters in: sets play a role 
in our best theory of the world, while the other entities named do 
not. Thus, according to Maddy, Steve has acquired a perceptual 
belief about a set. But has he seen a set? 

The remaining point is that the set of eggs has participated in the 
acquisition of Steve’s beliefs in the right way. Maddy adopts Grice’s 
method of finessing the problem of what “the right way” is: “the 
thing perceived must play the same sort of role in the causation of 
the perceiver’s perceptual state as my hand plays in the generation 
of my belief that there is a hand before me when I look at it in good 
light.” The hand plays the following role: light that has interacted 
with an appropriate part of it is focused on the retina, and that 
causes appropriate cell assemblies to be stimulated in an appro- 
priate sequence. To see a set, Steve must have appropriate cell as- 
semblies that developed in the same sort of way as did the ones that 
play a role in seeing a hand in front of your face. Maddy tells a 
satisfactory story involving Piaget on sets and a Hebbian neural 
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“set-detector.” In addition, light that has interacted with an appro- 
priate part of the set must have been focused on Steve’s retina. But 
the eggs, which are what the light interacted with, are not parts of 
the set. Thus, despite Maddy’s arguments, Steve did not see the set. 
Here is another example: according to Maddy, “the most natural 
way of understanding” the perceptual belief “here are three pairs of 
shoes” involves a set of sets—the set whose members are three sets, 
each of which contains two shoes as members. Even though the 
shoes are members of members of the set, they are not parts of the 
set. There is a more direct argument to the same conclusion: sets are 
abstract objects, and so they cannot have causal interactions. 

Maddy established that Steve’s belief that there are three eggs is a 
perceptual belief about a set. Thus, since the last criterion for per- 
ception of a set fails, what we conclude is that Steve acquired a 
perceptual belief about the set without having perceived it. That is 
not paradoxical: when you pass a streetlight at night and see the 
shadow of someone carrying a knife, you acquire a perceptual belief 
that there is a mugger behind you, but you have not seen the mug- 
ger. If we acquire perceptual beliefs about sets, that suffices for 
Maddy’s account of mathematical intuition. Since Maddy maintains 
in addition that we see sets, she grants that we can causally interact 
with them, and hence that sets are not abstract. The amended ac- 
count avoids the need to grant that. One cannot perceive the empty 
set, and so Maddy owes us an account of why perceptual beliefs that 
a set has zero members are to be analyzed differently from percep- 
tual beliefs that a set has one or more members. On the amended 
account, the empty set can be handled in the same way that other 
small sets can. 

Maddy only exploits her argument to account for mathematical 
intuition, but it is likely that parallel arguments could be used to 
maintain that we perceive a wide variety of things ranging from 
statements to melodies to causes. Her argument can be fruitfully 
recast in a contrapositive form: skepticism about the possibility of 
giving a naturalistic account of seeing sets leads to parallel skepti- 
cism about the possibility of giving a naturalistic account of seeing 
medium-sized physical objects. 

In the process of acquiring the concepts of set and physical ob- 
ject, those concepts become structured in such a way that some 
beliefs are inevitable while others are impossible. Those intuitive 
constraints on belief arise with the concepts themselves. The combi- 
nation of Piaget and Hebb shows how the process might be natural- 
istically realized. Since those intuitions are obtained in virtue of the 
right kinds of interactions, we have, Maddy claims, some reason to 
believe they will be reliable. 
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Has Maddy succeeded in her task of defending “the mathematic- 
ian’s pre-philosophical attitude”? We must consider set theory and 
its history. Maddy gives an eminently readable account of the devel- 
opment of set theory from its inception through the 1980s which is 
carefully focused on philosophically relevant aspects. This part of 
the book is a tour de force of exposition that makes much important 
information accessible. I shall only discuss the axiom of choice here. 

Cantor used the well-ordering principle—every “well-defined” 
set can be well ordered. Zermelo gave a proof (1904)? that used the 
assumption 


that even for an mfinite totality of sets there are always mappings that 
associate with every set one of its elements. .. 


That is Zermelo’s version of the axiom of choice. Modern versions 
of the axiom drop the requirement that the totality be infinite. 
Nonetheless, the whole point is the infinite case: it is easily proved 
that finite totalities have such ‘‘mappings’’—choice functions. Zer- 
melo proposed that the axiom be accepted on the grounds that it is 
“intuitively evident and necessary for science ...’’ (1908a). Baire, 
Borel, and Lebesgue opposed the axiom, arguing that it was intu- 
itively implausible. The clash of intuitions prima facie casts doubt on 
the reliability of mathematical intuition. As Maddy argues in detail, 
however, the apparent unreliability of intuition in this case can be 
explained away as the result of mixing up reliable intuitions about 
two distinct types of entities—mathematical and logical collections. 
Intuition reliably indicates that the axiom of choice is true of mathe- 
matical collections, and mathematical developments show this to be 
the more useful notion of collection on which to focus. This account 
of the axiom of choice seems to me to be correct, by and large, and 
Maddy has done a real service in presenting it in detail.’ 

Maddy’s history of the axiom of choice shows there are two differ- 
ent notions of collection about which we have differing intuitions. 
On her account of intuition, there must then be two different con- 
cepts of collection acquired as the result of different experiences. 
That conflicts with her account of the acquisition of the set concept, 
which only made room for one. 

It may be possible to give an account of how theoretical consider- 
ations split the intuitive set concept into two. That would not suffice 
for Maddy: she must explain how we have two notions of collection 


* References are to the bibliography of the book under review. 
* Something like it was suggested earlier by R. Bunn, From the Calculus to Set 
Theory 1630-1910, I. Grattan-Guiness, ed. (London: Duckworth, 1980), ch. 6. 
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about which we have differing intuitions, not just differing beliefs, 
and for her intuitions cannot arise as the result of theoretical consid- 
erations. Given Maddy’s set-theoretic realism and her account of the 
origins of intuition, she must presumably show that we can see sets 
(the mathematical notion), and that we can see classes (the logical 
notion), and that we can tell whether we are seeing a set or a class, 
and that our differing experiences of sets and classes are adequate 
to give reliable intuitions for and against the axiom of choice, respec- 
tively. That seems a hopeless task. 

We now consider whether our intuitions concerning mathematical 
collections count in favor of the axiom of choice. History is unequiv- 
ocal: we have intuitions concerning mathematical collections which 
legitimately count in favor of the axiom. But how can that be 
the case? 

We begin with Maddy’s answer to a different question: How can 
experience of a limited number of things lead to legitimate expecta- 
tions concerning anything beyond that number, and about which 
things should it lead to expectations? According to Maddy, it is an 
objective fact that things are carved up into natural kinds. Our expe- 
rience of a limited number of things of the same natural kind leads 
to evidence concerning other things of that kind. Thus, if liquids 
form a natural kind, our experience that liquids we come into con- 
tact with are wet counts in favor of the expectation that liquids 
are wet. 

Maddy takes sets to form a natural kind. That is how she explains 
why we should take the primitive concept of “something that has a 
number property” to be a concept of sets, not something else. It 
also figures in the explanation of how we know that the intuitions 
that arise in virtue of that concept of set are intuitions concern- 
ing sets. 

Our experience that liquids are wet does not legitimately count in 
favor of the expectation that all liquids are wet: the liquids we come 
into contact with are aqueous, and not all liquids are. Our experi- 
ence tells us nothing about whether nonaqueous liquids are wet. 
More exactly, if a chemist knows that not all liquids are aqueous, but 
that all of the liquids in daily experience are aqueous, then that 
chemist cannot legitimately expect that nonaqueous liquids are wet 
on the basis of daily experience. That is even more true if the chem- 
ist has reason to believe that aqueousness is relevant to wetness. 

We return to the axiom of choice. All of the sets in our daily 
experience have choice functions—and that is itself a part of daily 
experience (let us just grant that for the sake of argument). Some- 
thing like that must be what Maddy has in mind when she refers 
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to “the intuitive evidence for choice.” Unfortunately, we are in a 
liquid-aqueous liquid situation. If sets form a natural kind, then so 
do finite sets. All of the sets in our experience are finite. Thus, we 
do not have intuitive evidence that infinite sets have choice func- 
tions. Moreover, finiteness is relevant to having a choice function. 

Zermelo probably fully understood the situation in 1904, when he 
formulated the axiom of choice. In 1908 (1908a) he makes that 
understanding explicit. Zermelo can no more rely on the type of 
intuition described by Maddy than our chemist could rely on daily 
experience, and yet Zermelo says that the axiom of choice is “‘self- 
evident.” And Zermelo is the beginning of a long tradition that is 
dominant today. Maddy cannot deny the self-evidence of the axiom 
of choice on the basis of the controversies that have surrounded it, 
since she has successfully explained those controversies away. She 
must therefore admit that her account of set-theoretic intuition is 
incomplete. Since she wants to give a naturalistic account of intu- 
ition, and “the human mind is finite,” that account will have to 
explain how we can in a warranted manner extrapolate from our 
experience of finite things into the infinite. 

The gap in Maddy’s story should not blind us to her accomplish- 
ments. She has provided the beginnings of a classification of the 
various justificatory practices of mathematicians, in so doing giving 
a philosophical analysis of the relevant history. And she has focused 
on one particularly troublesome justificatory practice of mathema- 
ticians: appeals to intuition. Even if Maddy’s account of mathemati- 
cal intuition is mistaken in some of its details, and even though that 
account is incomplete, it remains a counterexample to the claim that 
a naturalistic account of mathematical intuition is impossible. Her 
work has made it clear that the problem of giving an adequate ac- 
count of mathematical intuition is one of the central problems in the 
philosophy of mathematics. 

SHAUGHAN LAVINE 


Columbia University 


The Realm of Rights. JUDITH JARVIS THOMSON. Cambridge: Har- 
vard University Press, 1990. viii, 383 p. Cloth $35.00. 


Judith Jarvis Thomson is famous, or infamous, for her imaginative 
appeal to test cases in philosophical argument. In The Realm of 
Rights she defends and uses brilliantly her method of connecting 
our, or her, presystematic moral judgments about how someone 
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may or ought to act under specified circumstances with very general 
theses in ethical theory. Her book is a gold mine of important hy- 
potheses about what it is to have a right and what rights we have. 
Any moral philosopher who extracts the true gold and discards the 
few iron pyrites will be much the richer. 

Thomson reconstructs Hohfeld’s ideas of the fundamental legal 
conceptions and uses them in her analysis of our moral rights. Fol- 
lowing him, she takes claims to be fundamental to rights and a claim 
of X against Y to be equivalent to a duty of Y to X. Precisely what 
kind of a behavioral constraint, then, is a duty of Y to X? According 
to Thomson, what is central is (with qualifications) that it is morally 
permissible for Y to infringe X’s claim if and only if Y would thereby 
produce a sufficiently large and appropriately distributed increment 
of good or advantage (175). Additionally, other things being equal, 
Y ought to seek a release from X in advance and Y ought to compen- 
sate X later for such harms or losses as Y causes X by any infringing 
of the claim (200-1). 

This seems correct as far as it goes, for one may not infringe a 
claim merely to produce a marginal increment of value when all 
those affected by one’s act are counted. Still, more needs to be said. 
How can one explain the relational aspect of X's claim against Y 
consisting in Y’s duty to X? The only hint I can find is when Thom- 
son suggests that a legal duty might consist in the legal system’s 
attaching a penalty to “Y’s doing such and such to X” (71). Now, 
possibly a relative duty is owed to the patient, the party who would 
be acted on in any violation of that duty. Hohfeld, however, suggests 
that a relative legal duty is owed to either the party benefited by its 
performance or the party with the power to take legal action in the 
event of nonperformance. Since these three interpretations often 
diverge in practice, for example in cases of a third-party beneficiary 
under a contract, any adequate analysis of a claim must decide be- 
tween them or find another explanation of the relational aspect of a 
claim. Moreover, when one is told that ‘‘a claim just is a constraint 
on behavior” (202), one would like some clear account of the nature 
of a moral constraint. In a book on moral rights, Thomson need not 
decide whether the constraint of legal duties consists merely in the 
attached penalty for violation, but she can hardly ignore the ques- 
tion of whether moral constraint consists entirely or in part in moral 
sanctions, especially when she concludes that “there do not appear 
to be any simple, nontrivial theses connecting claims with the per- 
missibility of enforcing them” (116). 

If Thomson’s analysis of a claim were adequate, would it, as she 
alleges (200), suffice to explain what all rights are? I believe not. 
Even supposing that every power were some sort of ability to extin- 
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guish or generate a claim, one needs to explain what distinguishes a 
specifically moral ability from a nonmoral one. I have the ability by 
persistent pleading to cause my wife to promise to prepare my fa- 
vorite dish for dinner, but it is she, and not I, who has the moral 
power to confer upon me any claim that she do so. What is needed 
here is some moral analogue of Hohfeld’s legally operative facts, but 
how might one explain how certain facts could be morally operative? 

And even if every part of a cluster right were definable in terms of 
claims, there remain problems in explaining the unity of that right. 


Many of the most familiar rights . . . are clusters of rights, picked out 
and marked off by mention of a positive value, the rights in the cluster 
being those that in one or another way ‘protect’ the value (285). 


This is plausible in the case of the right to liberty, but does the value 
of liberty define the content of this right? This seems to imply that 
one’s right does not extend to valueless liberties, much less liberties 
that it would be harmful to exercise on this occasion. And is a trivial 
value sufficient to unify a cluster right? Thomson suggests that the 
value of the green cake with the walnut on top is what unifies her 
right to that cake (287-8). But surely it cannot be its value to her 
which confers on her any right to it. Perhaps she made it herself or 
the hostess promised it to her. If so, it would be the value of making 
or promise keeping which is central here, not the value of that tiny 
cake to which she has a cluster right. Moreover, precisely how do the 
Hohfeldian elements protect the unifying value? Thomson acknowl- 
edges by her use of quotation marks that something like Neil Mac- 
Cormick’s conception of normative protection is called for here, 
but she offers no account of what specifically moral protection 
might be. 
Thomson asks what feature of us is the source of our natural 
claims. She surveys two alternative possibilities. 
(i) We would have had no claims if we had not been subject to moral 
law, for if we were not subject to moral law, then a fortiori morality 
would not require of us that we accord any claims. ... (i) We would 
have had no claims if we had been subject to moral law, but a moral law 
that requires what is incompatible with ¥’s being, in respect of X, 
under the behavioral constraint that X’s having a claim against Y con- 
sists in (222). 


What is puzzling about both alternatives is that they are primarily 
features of the duty bearer rather than the right holder. This calls 
into question her presumption that it is something about us that 
explains why we have claims. 

She goes on to argue (215-21) that (ii) would have been true of us 
if and only if we had had no interests that were not identical with the 
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interests of the group as a whole. Hence, one source of our natural 
claims is our inherently individual interests. But why imagine that 
the content of the moral law that applies to us is determined by the 
nature of our interests? Thomson seems to presuppose some sort of 
teleological theory of obligation here. One wonders how she can 
also assert “There is more to morality than value: there are also 
claims” (148). No doubt her distinction between value and good for 
a person (150-2) is relevant here. What remains in doubt is how, if 
at all, this distinction can be incorporated into a coherent and plau- 
sible view of moral obligation. 

What basic rights do we have? Apart from our cluster right to 
liberty and various powers and immunities, we have only pure social 
claims arising from someone giving her word or from property law 
and natural claims that others not commit trespass against us or 
cause us harm or non-belief-mediated distress. In Part II of The 
Realm of Rights, Thomson appeals to our presystematic moral 
judgments to show that we do have these rights. She defends her 
thesis that we have no additional claims by examining plausible 
counterinstances and suggesting that each has morally unacceptable 
implications. Although I suspect that her “limits thesis” (273-4) is 
mistaken, her argument for that thesis is entirely appropriate, and 
she is surely correct to conclude that the burden of proof lies with 
anyone who would assert that we have any additional moral claims. 

The Realm of Rights does not pretend to be a complete and 
systematic theory; what it proposes is a way of looking at rights. 


To have a right is to have a kind of moral status, so working out what a 
right is comes to the same as working out what people ought or ought 
not to do, may or may not do, given a person has a right (373). 


Instead of beginning with some normatively neutral definition of “a 
right” and then going on to investigate what rights we have, Thom- 
son appeals to the moral implications of rights throughout. The 
result is a book that is immensely rich in moral insight and illumi- 
nates, as too few philosophical treatises do, what is of practical im- 
portance in rights. But a moral theory contains explanatory theses as 
well as object-level moral judgments (30). On this level, especially 
regarding the sources of our rights, this book leaves many questions 
unanswered. Fortunately, there is hope that Thomson’s view of 
rights can be helpful as she, and we, continue to probe these 
questions. 
CARL WELLMAN 


Washington University 
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NOTES AND NEWS 


The editors regret the untimely death of Professor Gerald Feinberg on 
April 21, at the age of 58. Professor Feinberg obtained his bachelor’s de- 
gree in 1953 and his doctorate in physics in 1957, under T. D. Lee. Both 
degrees were from Columbia. He had been a faculty member of the Colum- 
bia Physics Department since 1958. Professor Feinberg is best known for 
his contributions to the theoretical physics of neutrinos and of interatomic 
forces, and for the theory of tachyons, hypothetical relativistic particles that 
travel faster than light. He was the author of numerous books and articles 
on a wide variety of subjects, including the philosophical foundations of 
physics. His work continued until the very end of his life. His last philosophi- 
cal paper will appear in these pages in the next few months. 


The Planning Committee for the XXth World Congress of Philosophy in- 
vites suggestions for the overall theme of the Congress, which will be held in 
Boston on August 10-16, 1998. The theme must be specified in the Plan- 
ning Committee’s formal proposal to host the World Congress, which will 
be considered by the General Assembly at the XIXth World Congress in 
Moscow in August 1993. It should be a topic of broad interest to philoso- 
phers in the United States and abroad. Suggestions should be sent to the 
Chair of the Planning Committee, Richard De George, Phil. Dept., Univ. of 
Kansas, Lawrence, KS 66045. Suggestions regarding potential sources of 
financial support from governmental, foundation, and corporate sources 
are also invited. 


Professor and the late Mrs. Colin Turbayne established an annual Berkeley 
Essay Prize competition in conjunction with the Philosophy Department at 
the University of Rochester. They are happy to announce that the prize 
winner in the third year is Lisa Dowling from Yale University. The next 
deadline for submitting papers is November 1. Submissions on any aspect 
of Berkeley's philosophy are welcome. Essays should be unpublished, writ- 
ten in English, and less than 5,000 words. Submissions should be sent to 
Richard Feldman, Chair, Phil. Dept , Univ. of Rochester, Rochester, NY 
14627-0078. 


The Editorial Board of The Philosophical Quarterly would like to an- 
nounce the topic of its 1992 essay competition: Scholasticism: Old and 
New. Contributions may be on any topic falling within this general theme; 
especially welcome, however, will be papers which explore connections be- 
tween issues drawn from different historical periods, and in which areas of 
current philosophical mterest are discussed. The final date for submission is 
November 1. Essays should not be longer than 6,000 words and should 
conform to.the usual stylistic requirements. Two copies of each essay are 
required, all entries will be regarded as submissions for publication in The 
Philosophical Quarterly, and both winning and nonwinning entries judged 
to be of sufficient quality will be published. For further information, you 
may write to Executive Editor, The Philosophical Quarterly, -Univ. of St. 
Andrews, Scotland KY16 9AL. 
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CAESAR’S WIFE: ON THE MORAL SIGNIFICANCE OF 
APPEARING GOOD* 


UETONIUS and Plutarch both report that Caesar divorced 
his wife Pompeia because she was suspected of adultery. In 
the story related by them, a man named Clodius sneaked into 
Caesar’s house dressed as a woman in order to rendezvous with 
Caesar’s wife, apparently with her complicity. Clodius was appre- 
bended, and Pompeia fell under suspicion. It is not clear that she 
was actually involved with Clodius. In any case, actual guilt or inno- 
cence was irrelevant to Caesar. Caesar’s wife must be above suspi- 
cion. Since her behavior resembled guilty behavior, she is blamed.’ 
I shall explore the question of whether an action that is not im- 
moral in virtue of any independent reasons can be bad, or immoral, 
in virtue of resembling an immoral action. The answer will be “yes.” 
What is surprising about this answer is that it means that, in an 
important class of these cases, the immorality of some actions will 
depend completely on the mistaken beliefs of other people. 
1 THE MORAL SIGNIFICANCE OF RESEMBLANCE 
That an action’s resemblance to an immoral act may be morally 
significant has been noted before, but with little accompanying anal- 
ysis, Kant, for example, relies on a resemblance criterion in his Lec- 
tures on Ethics? when discussing duties we have toward animals: 


* This paper has benefited from conversations with Jonathan Adler, Sarah 
Buss, John Dick Moran, Connie Rosati, Roy Sorensen, and William Wil 
hams; from the Philosophy and Public Affairs Society of New York in February 
1991, Susan Brison, Gertrude Ezorsky, and Frances Kamm; and from 
the 199 Pacific Division Meeting of the American Philosophical Association, 
especially Hugh Lehman and Terrance McConnell 

See Plutarch, The Lives of the Noble Grectans and Romans, John 
and Arthur Hayden Cl trans. (New York: Random, 1864), pp. 859-60. 
* Louis Infield, trans i ls Hackett, 1963). 
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since animals are similar to human beings, cruelty toward animals 
will undermine genuine duties toward other persons. Thus, one 
ought to refrain from cruelty toward animals in pursuing one’s duty 
toward other persons. 


Animal nature has analogies to human nature, and by doing our duties 
to animals in respect to manifestations which correspond to manifesta- 
tions of human nature, we indirectly do our duty towards humanity. 
Thus, if a dog has served his master long and faithfully, his service, on 
the analogy of human service, deserves reward ... (ibid., p. 239). 


If a person behaves viciously toward an animal he is in danger of 
corrupting himself by failing to reinforce his benevolent tendencies 
—tendencies that should be cultivated to support the sense of duty 
toward humanity. Since animals resemble humans, and animal be- 
havior resembles human behavior, cruelty toward animals may lead 
people to assimilate this with cruelty toward humans. 

Jane English,’ after describing a Kantian account of why it is 
wrong to mistreat animals, extends the analysis by citing resem- 
blance as a factor in deciding on the permissibility of abortion. 


... the similarity of a fetus to a baby is very significant. A fetus one 
week before birth is so much like a new born baby in our psychological 
space that we cannot allow any cavalier treatment of the former while 
expecting full sympathy and nutritive support for the latter (ibid., p. 
175). ' 


English further suggests that resemblance explains why we show 
respect for dead bodies, even though they are not persons. 

No doubt, resemblance exerts a powerful influence over human 
psychology. According to Hume,* it is resemblance between people 
which helps to explain the force of sympathy: 


... this resemblance must very much contribute to make us enter into 
the sentiments of others and embrace them with facility and pleasure. 
Accordingly we find, that where, beside the general resemblance of our 
natures there is any peculiar similarity in our manners, or character, or 
country, or language, it facilitates the sympathy (ibid., p. 318). 


Indeed, it is no accident that during times of war the enemy is often 
portrayed as being very different from us, since this helps to cut 
them off from any sympathy we would feel. 


3 “Abortion and the of a Person,” in Morality in Practice, James 
Sterba, ed. (Belmont, CA: W rth, 1988). 

“A Treatise of Human Nature, L. A. Selby-Bigge, ed. (New York: Oxford. 

967). 
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This discussion shows that, in addition to leading to self- 
corruption, resemblance is also relevant because of its effect on 
others. Someone witnessing a late abortion may grow callous toward 
newborns. Also, people are liable to make mistakes about what it is 
that they are seeing. If someone witnesses an action that she thinks 
is directly immoral, then the witness’s morals could be corrupted. A 
person who witnesses the desecration of some bodies, for example, 
may become callous and brutal. 

Broadly speaking, then, there are two ways in which resemblance 
can be morally significant. 

(1) First of all, resemblance causes us to assimilate, psychologi- 
cally, different types of actions. Because of our “coherence of atti- 
tudes,” we need to be sensitive to resemblance so that we do not 
corrupt ourselves or others.” Psychologically, we are so constructed 
as to be influenced by resemblance in the making of our moral 
judgments. English argues that our attitudes and sympathies, which 
reinforce our moral principles, need to form a coherent set, thus: 
“If our moral rules allowed people to treat some person-like non- 
persons in ways we do not want people to be treated, this would 
undermine the system of sympathies and attitudes which makes the 
ethical system work” (ibid.). If one performs actions that look im- 
moral, one may develop a taste for doing the real thing, or a truly 
unpleasant disposition. So, resemblance can directly cause harm by 
breaking down resistance to certain actions. By analogy, our sympa- 
thies are dampened and our attitudes may shift in a bad direction. A 
person who sees a 7-month fetus thrown into a garbage pail and an 
8-month premature infant in intensive care may become brutalized 
to the suffering of these infants because the resemblance between 
fetus and infant will cause her feelings to change. Likewise, some 
actions will be good in virtue of resembling independently good 
actions. Not only should we refrain from hurting animals, but maybe 
we should even be nice to them to encourage similar behavior to- 
ward humans. Kant’s faithful dog should be rewarded, so that 
agents also reward faithful persons. 

(2) Second, resemblance can lead people to make mistakes—to 
misconstrue an action’s moral character.’ A person who wears fake 


® See Enghsh, pp. 174-5 for a brief discussion of this. 

aif ude notes (p61) that reeeniblancevts a great raora o miteapersondl ena 
because it leads to mistakes in reasoning: ‘‘there are few mistakes in reasoni 
which do not borrow largely from that ongin. Resembling ideas are not o 
related together, but the actions of the mind, which we employ in considering 
them, are so little different, that we are not able to dis ish them. This last 
circumstance is of great consequence; and we may in gen observe, that when- 
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fur could be mistaken for a person who wears real fur. If an action 
resembles an immoral action, then it is not surprising that it could 
be misunderstood as independently immoral in some contexts. Mis- 
construal, in turn, has its own corrupting influences. Such miscon- 
strual can corrupt the morals of others, in addition to causing other 
harms. This is the most interesting claim regarding the moral signifi- 
cance of resemblance, because it means that the immorality of some 
actions depends on the agent realizing that their true nature will be 
misunderstood. 

The crucial difference between the two ways in which resem- 
blance is significant is that the second involves a misconstrual of the 
action. In the first case, the person performing the action or wit- 
nessing it may know full well the difference between, for example, 
the dead person and the live person. It is just that the resemblance 
between them is forceful enough that perhaps the abuse of the 
former would lead to decreased respect for the latter. In the second 
class of cases, however, the witness to the action mistakes it for some 
other action. These mistakes or misconstruals can lead to unfortu- 
nate consequences in one of two ways. First, a person can witness an 
action and mistake it for another action that is immoral. The witness 
does not believe that the immoral act is immoral, and this leads to 
the witness’ doing or condoning that type of action. For example, in 
the Caesar’s wife case, the witness thinks that Pompeia has com- 
mited adultery when in fact she has not, and comes to the conclu- 
sion that adultery must be all right after all. Second, a person can’ 
witness an action and mistake it for another action that he does 
believe to be immoral. Still, witnessing the supposedly immoral act 
causes there to be less pressure on him to avoid those immoral acts. 
Thus he will go ahead and do them. Thus the witness might see 
Pompeia leave the door open for Clodius at midnight, think that she 
is planning an adulterous liason when in fact she is not, but, know- 
ing that adultery is immoral, do it anyway. If she can get away with it, 
I can too. This is the more plausible sort of scenario in the Caesar’s 
wife case. 

Also, misconstrual could lead to good consequences, if an action 
is mistaken for a good one because of its resemblance to an indepen- 
dently good action. Consider George Bush donating blood. This 
donation of blood looks like a good action because donating blood 





ever the actions of the mind in forming any two ideas are the same or resembling 
we are very apt to confound these ideas, and take the one for the other.” The 
issue of concern ın this paper is that of interpersonal error that is the result of 
resemblance. 
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helps others. There is a sense in which George Bush’s donating 
blood is a waste, however. Given all the secret service people who 
need to guard him to and from the bloodmobile, his donation will 
cost the tax payers a lot of money. His action, on scrutiny, is dissimi- 
lar to the ordinary citizen’s. Still, it resembles an ordinary person’s 
sacrifice, and is easily mistaken for such, and thus it has great sym- 
bolic value. This symbolic value can be cashed in in the hard utilitar- 
ian currency of consequences. It will motivate others to give blood 
—those who can give blood efficiently. 

These observations give us a place to start in formulating a princi- 
ple of action. 


(CP) An action is itself immoral (moral) if it resembles an indepen- 
dently immoral (moral) act. 


An independently immoral act is an act that is immoral for reasons 
other than resemblance to something immoral. Actions immoral due 
to their resemblance to independently immoral actions I call mi- 
metic immoralities. Both mimetically immoral actions and indepen- 
dently immoral ones are immoral, but they are different descrip- 
tively, and also in terms of the degree of condemnation typically 
associated with them. 

(CP) is false. It is open to fairly obvious objections and counterex- 
amples. First of all, a slippery slope is easily generated. It is immoral 
to drive 100 mph in a school zone. And if it is immoral to drive n 
mph in a school zone, it is also immoral to drive n—1 mph in a 
school zone (since the speeds are nearly indistinguishable). Thus, it 
is immoral to drive 10 mph in a school zone. The second step of this 
induction would be supported by (CP) as it stands at this point. 
Further, some actions that resemble the genuinely immoral are per- 
missible and even admirable. An actress may innocently perform a 
pretend murder; lifeguards are applauded for administering mouth- 
to-mouth recusitation. Nevertheless, (CP) has a grain of truth. As 
above cases illustrate, there is something about resemblance which is 
morally significant. One of my goals here is to articulate that sig- 
nificance. 

O AVOIDING MISCONSTRUAL 
Most of us at some time or other are faced with the choice of 
whether to do something that could be misconstrued as immoral by 
others. A person who believes that killing animals for fur is immoral, 
and thus will not wear a real fur coat, may still feel uneasy about 
wearing a good imitation; or a person who runs away from the scene 
of a crime because he needs to make it to the bank on time may 
worry that his action could be misconstrued as the action of a crimi- 
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nal; or the feminist who goes into a strip-tease joint just to use the 
toilet might have a similar worry that his action could be miscon- 
strued. How can we best explain the moral unease?” 

Intuitions seem inconveniently split: on the one hand, one is 
tempted to say: “Who cares about what others think? I’m going to 
do what I think is appropriate, what J know to be perfectly okay.” 
This expresses moral individualism, a strain of thinking that runs 
through our common-sense ethics. It pictures moral obligations as 
autonomous relations, for example. Moral individualism can be ar- 
gued for a number of ways. One approach is the idea that moral 
obligations can only be voluntarily acquired, that we have ultimate 
control over the moral aspects of our lives. If we accept the moral 
relevance of appearances, then morality falls into the hands of the 
illogical and the ill-informed. What is unpleasant here is that, if one 
is prohibited morally from performing such actions, another bit of 
one’s autonomy is lost. There is nothing at all intrinsically wrong 
with these actions, yet, sensitive to consequences, the agent must 
refrain from doing them. And the consequences are brought about 
because others misunderstand the agent’s action. That is particu- 
larly galling. Why should the agent be responsible for that? 

The moral individualist need not shirk responsibilities. Indeed, he 
frequently stresses that an agent is responsible for his own actions. 
He maintains, however, that how others merely perceive actions is 
irrelevant. This theme breaks to the surface in some of Bernard 
Williams’s® writing on integrity. Williams complains that utilitarian- 
ism, for example, fails to give adequate weight to the concerns and 
projects of the individual. In his critique of utilitarianism he has 
pointed out (in this vein) that, if utilitarianism is correct, then some 
actions that are prima facie wrong are actually right; and that this is 
psychologically too jarring, and demands too much of the moral 
agent. For example, the utilitarian would hold that, if killing another 
human being would clearly maximize good consequences, then the 
agent should kill. Otherwise, she is responsible for whatever bad 
consequences are the result of the failure to act. Thus, an action like 
killing, which seems terribly wrong, could be required—on the utili- 
tarian view, it could turn out to be the right thing to do. Thus, the 


7 Some might maintain that the unease is caused by a fear that one’s reputation 
will suffer from the misconstrual While I think this is a genuine concern—no one 
wants to be Mougn of as immoral—it does not exhaustively explain the unease. 
One is worried there is something wrong in the action, even though one’s 
motives may be pure. 

3 “A Critique of Uuhtarianism,” in Utilitanantsm: For and Against, J. J. C. 
Smart and Willams (New York: Cambridge, 1973), pp. 77ff. 
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agent may be required to do something that goes against his most 
deeply held convictions and commitments. This, Williams argues, 
violates the agent’s integrity. 

The other side of that coin is that some actions that are prima 
facie permissible, or even right, are really wrong—e.g., mimetically 
immoral acts. A utilitarian would require of the moral agent that he 
not wear the imitation fur, if the action is open to misconstrual and 
would corrupt or deeply hurt others. 

The moral individualist will not appreciate having her actions 
constrained by the false beliefs of others. This is because it entails 
that there is some aspect of her moral life over which she has very 
little control. It makes the quality of her actions depend on the 
interpretation of others. On the other hand, in contrast to moral 
individualism, one does not want to give a false impression, espe- 
cially when it could influence other people. If one’s behavior is 
influential, then one should not even give the appearance of im- 
proper behavior.’ 

One explanation for the muddled intuitions is that we often seem 
to buy into a moral safety argument in trying to figure out how we 
ought, or ought not, act. If I cannot decide whether an act is permis- 
sible or not, I shall play it safe (morally) and refrain from doing it. 
For example, if a person feels that killing animals is wrong because 
they are sentient, she may be uncertain about where to draw the 
line. Is it all right to eat shell fish? To play it safe, she may just refuse 
to eat any animal at all. 

In Amish societies, this tendency to “play it safe” has often led to 
splits in the group. Suppose that there is a prohibition against wear- 
ing more than three buttons on one’s clothing at any given time. 
This might be because buttons are viewed as personal adornment, 
and thus frivolous. Some people, in order to make absolutely sure 
that they are not behaving frivolously, will begin to wear even fewer 
buttons. This tendency seems to change standards of appropriate- 
ness—because one does not want to even give the appearance of 
frivolousness one will wear only 2 buttons, or only 1, or even no 
buttons at all. The desire here is to play it safe. But in doing this 
eventually what was once optional goodness becomes required. 

Perhaps this is so with moral appearances as well. Perhaps we are 
uncertain about the moral nature of these actions because some of 


° It is also possible that this type of misconstrual will have good effects. Per- 
haps, for example, these actions encourage people to be more charitable since 
once in a while they will discover that they misinterpreted the nature of an action 
they witness So, if the misconstrual occurs, and is then discovered, ıt May actu- 
ally do some good as well. 
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their features do seem to make them immoral. The uncertainty is 
caused by using an analogy with the independently immoral acts, 
and concluding that these actions are immoral, too. To make abso- 
lutely sure we are moral people, then, we might not perform mi- 
metically immoral actions. 

Yet I do not think that moral safety considerations can fully ex- 
plain the phenomenon. Consider again the fur-coat case: here, the 
agent knows that the coat was manufactured in an acceptable man- 
ner: no leg-hold traps are used in the manufacture of artificial fur. 
No fur trader received a benefit when the agent obtained the coat. 
There is no uncertainty in this case about whether wearing the coat 
can be described as independently immoral, because the reasons 
used to show that real fur is immoral are lacking in this case. The 
analogy does not exist at this level of reasons. Yet the act still seems 
tainted morally—and purely in virtue of its resemblance to the inde- 
pendently immoral act of purchasing a real fur coat. 

To make the matter a little more clear: in the imitation-fur-coat 
case, wearing it would not seem bad if one wore it only among other 
persons fully aquainted with the facts. And in this case it is certainly 
possible to be fully acquainted with the facts. If the agent were to 
wear the coat only in the presence of close friends who knew the 
origins of the coat (or, better yet, in private), moral discomfort 
would almost completely disappear. Thus, audience relativity is an 
important factor. An action may be immoral relative to one au- 
dience and not to another. 

An actor on the stage performs actions that resemble those which 
others perform off stage and in earnest. So, the actor playing 
Othello pretends to strangle, and his action resembles a genuine 
strangling. Yet the context is such that it is clear to everyone in the 
audience that the actor is not really strangling anybody. In an artifi- 
cial setting such as this there is no danger, because the audience is 
not liable to misconstrual. But, in real life, misconstrual of one’s 
actions will almost certainly take place. 

For these reasons, the explanation for why such mimetically im- 
moral actions are immoral is that these actions, because they are 
open to reasonable misconstrual, lead to bad consequences. When- 
ever we are in a situation where our actions can be so misconstrued, 
we must be careful, because what we do may set an example for 


others. 
UI. SETTING AN EXAMPLE 


In his Lectures on Ethics, Kant stresses the importance of good 
examples in moral life: they demonstrate the practical possibility of 
moral acts. “A bad example, however is a stumbling block, and gives 
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occasion for two evils, its imitation as a pattern and excuse” (op. 
cit., p. 111). Further, Kant writes: 


It is true that I can scarcely be blamed if another mistakes and misap- 
plies what I have done and it corrupts his morals, although I meant it 
otherwise and it had no immoral effect on me; but, though I am not 
liable for his misconstruction, I ought none the less to exercise re- 
straint to avoid giving occasion for any such scandalum (op. ctt., p. 
111).?° 


I take it that Kant is saying that, though one is not in any sense 
required to avoid performing what I call mimetically immoral acts, 
one ought not—in some weaker sense—perform them. That is, one 
is not responsible for the misconstrual—and the example Kant goes 
on to discuss makes it obvious that one is not responsible even if one 
can foresee that the misconstrual will take place. Still, one should 
not perform such actions, because one would be setting a bad exam- 
ple for others. 

Extrapolating from Kant’s discussion of animals, my guess is that 
Kant would hold that we ought not to perform mimetically immoral 
actions because to perform such actions would undermine our 
moral commitment to avoid doing independently immoral actions. 
The duty is an “indirect” one. If there is a danger of self-corruption 
one should not perform these actions. Also, as the above passage 
indicates, even when there is no danger of self-corruption one 
should refrain because the mistaken individual who witnesses the 
action stands in danger of corruption. 

Usually these indirect duties are described as pseudo duties one 
has to animals and inanimate objects in virtue of the duties one has 
toward rational beings. They are postulated to explain why we think 
cruelty to animals, and the wanton destruction of an object, are 
immoral acts—even though the objects are not rational. The real 
duties, or the direct duties, are those we have toward rational beings 
as ends in themselves. There is nothing intrinsically wrong with 
hurting animals, but one should refrain from hurting them because 
such behavior will lead to a lack of concern for rational beings. Here 
the analysis depicts the first role of resemblance—that is, the effect 
it has on our moral thinking in virtue of our tendency to assimilate 
behaviors that resemble one another. Of course, this would mean 
also that one has an indirect duty to perform actions that reinforce 
duties to rational beings in this way. Showing respect for dead bod- 
ies would be one example. 


10 “Scandalum” is an example that furnishes an excuse to act badly. 
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But the analysis can be extended to the second role—where mis- 
construal enters into the picture. Once again, one should refrain 
from performing the action because it will lead to a lack of concern 
for rational beings. Of course, with the fur-coat case we get an 
“indirect” indirect duty. But in the case of the feminist we can see 
why he would be reluctant to use the toilet in a strip-tease joint. He 
might be mistaken as a patron, thus lending legitimacy to an estab- 
lishment that unabashedly treats rational beings merely as means. 

But Kant’s analysis is not entirely satisfactory. First of all, it strikes 
me as arbitrary and misdirected. Indirect duties were postulated 
merely to explain our feelings or intuitions that happened to con- 
flict with Kant’s theory. Suppose, though, that we had a feeling that 
combing cat hair was terribly wrong. Would it then follow that we 
have an indirect duty to refrain from combing cat hair? Because, 
perhaps, it resembles pulling out people hair? I think, intuitively, 
that the reason we have misgivings about these actions is due to their 
consequences. And insofar as Kant’s analysis is plausible it recog- 
nizes this. Thus, the analysis allows for back-door consequentialism. 
We are worried about how the perceptions of these actions will 
affect others. 

Also, Kant’s analysis may have difficulty handling other cases. 
Sometimes it is even wrong to do something that looks virtuous. 
Suppose you are concerned, as we all are, that children develop a 
healthy fear of strangers. Offering an unrelated child a ride home 
from school is not a good thing to do, even if your motives are pure. 
That is because the children witnessing the kindness may lose their 
fear of strangers. 

I do not see how Kant’s view could really deal with this case effec- 
tively. Would he argue that I have an indirect duty to avoid giving 
rides to school children because my action resembles the action of 
the potential child abuser? But how is anybody’s sense of duty 
corrupted? My sense of duty is not corrupted, nor is the sense of 
duty any child witnessing my altruism. Rather, healthy instincts are 
not reinforced. The analysis here is purely consequentialist. 

One might believe, once again in a Kantian vein, that what is 
morally dubious or questionable about these actions is that in per- 
forming them the agent deceives others and so manipulates them. 
Yet these actions are not actually deceptive because the agent does 
not intend to generate false beliefs. Indeed, if she could perform the 
action without generating those beliefs, she would. Still, in perform- 
ing the action the agent does mislead others, and perhaps this is 
what explains the unease. 
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But performing misleading actions—and even deceptive ones—is 
not sufficient for moral blame. When Roger puts shoulder pads in 
his jacket he may mislead others about his build, but this is not 
immoral. The best answer is that what explains the unease is that 
people are worried about the consequences of performing mimeti- 
cally immoral actions. The consequences of being misconstrued are 
various: (1) a bad example is set, and the agent stands in danger of 
corrupting himself and others; (2) the agent could give offense to 
others; or, as in the case of Pompeia, (3) the action could make the 
lives of others miserable. The split in intuitions occurs because the 
bad consequences are not the same as the ones that make the inde- 
pendently immoral act immoral. 

That setting an example is a legitimate moral consideration is 
supported by the way we interact with children. When we interact 
with them we are especially careful in monitoring our conduct, be- 
cause we recognize the importance of setting a good example (or at 
least refraining from setting a bad example). Children can very easily 
misunderstand the actions of others, because they are unaware of 
the relevant facts in a given situation. One must be careful when 
telling jokes around children, since often they cannot see the joke. 
One must be careful in telling a judicious lie around children, since 
they cannot tell the difference between the judicious lie and the 
mean lie. Anyone who did not take this into consideration when in 
the presence of children would be morally lax. 

In most of our interactions with adults, relevant facts are clear, 
such as the assumptions and beliefs it would be reasonable for a 
person to make in a given clear context. Context usually is pretty 
clear for the purposes of moral evaluation.’ But not always. Thus, 
care needs to be taken because a bad example can and will have a 
bad effect on others. This consideration becomes stronger the more 
the agent is in the public eye; that is, the larger her audience is. A 
police officer, a judge, and a senator must avoid even the appear- 
ance of immorality. 

But more ordinary examples abound as well. People will often feel 
constrained to make restitution for things that merely look immoral. 
A person walking in a store looking at vases may pay the manage- 
ment if a vase, coincidentally, should fall and break near her. Even 
though she did not break it, it looks as though she did, and if she 
failed to pay for it she could be setting a bad example. If the cost of 
the vase is small, she will think it worth paying to offset any negative 
effects. 

Misconstrual can also lead to giving unnecessary offense to 
others. When I visit my grandmother, for example, I am careful not 
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to take along any book that might have ‘sex’ in the title. Even though 
any such book in my possession would be a purely scholarly work, 
my grandmother probably would not realize this, and become dis- 
tressed and offended. 

I should also point out that some of these misconstruals will be 
quite reasonable. Take the fur-coat case. In this case it would not be 
unreasonable for the observer to form the belief that the coat is real 
fur. She would simply be reasoning through a straightforward statis- 
tical syllogism. 99% of all coats like that are fur, therefore that coat 
is fur. 

But what if the misconstrual is unreasonable? What if it is due to a 
fallacy to which humans are particularly prone? For example, peo- 
ple are very prone to committing the gambler’s fallacy. Suppose I 
am a captain trying to regroup my men in the midst of a battle. I 
jump into a nearby crater to try to protect myself from shrapnel, 
and I urge my fellow soldiers to seek shelter as well. Well, another 
soldier might think me a greedy selfish pig for availing myself of the 
crater, when in fact I am not. That is because a lot of people think 
that the chance of a bomb falling on that crater is less than it falling 
on some other spot. After all, is not the probability of two shells 
falling on the same spot very low? So, my action looks selfish, and 
this might serve to demoralize my soldiers. But it only looks selfish 
because people are prone to commit the gambler’s fallacy. Lying in 
that crater to protect myself from shrapnel, I have the same chance 
as anybody else of getting hit by a bomb. 

People are also remarkable for their superstitious tendencies. 
Suppose, as I prepare to run in a marathon, the computer arranging 
the shirt numbers assigns 666 to me. If I wear that number I may 
look like a devil worshiper, when in fact I am not. And that shirt may 
strike fear into the superstitious hearts of many people. . 

Some might argue that the agent is only responsible for reason- 
able misconstruals that others make—not the unreasonable ones. A 
consequentialist will argue, however, that this distinction is basically 
irrelevant since, as long as the misunderstanding can be predicted, it 
must figure into the agent’s calculations. The agent is, of course, not 
responsible for the odd irregularities of others, since she has no way 
of predicting them. 

I must say at this point that I am not arguing that people must at 
all times display behavior conforming to accepted moral norms. I 
should hope that the account could allow for the moral rebel here 
and there. I am not saying categorically that it is wrong to wear very 
good fake furs, because long-term consequences must be taken into 
consideration. If wearing fur is immoral, but some people are 
deeply attached to fur, so attached that without a good substitute 
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they would keep on wearing the real thing, then fake fur could be a 
happy compromise. Maybe at first bad values are reinforced because 
of the resemblance. But as people come to accept the fake fur on a 
more widespread basis, this effect will disappear, because people will 
recognize the fake as a fake. Thus, overall, the consequences of 
wearing these coats would be good. This is fine, as long as one did 
expect the long-term consequences to work out. But this itself 
points to the force of resemblance—it still has to be taken into 
consideration by the moral agent. During the transition period, be- 
fore the effect of the resemblance is neutralized by familiarity, peo- 
ple should take steps to make sure that their actions are not misin- 
terpreted. Maybe, in this case, by wearing ‘“‘Great Fake” buttons on 
one’s coat. Another example: Jewish law forbids the consumption of 
dairy products with meat. Thus, when nondairy creamer was devel- 
oped, one would have thought that this was a boon to the dinner 
diet at Jewish tables. The nondairy creamer looked just like cream, 
however, so people were too embarassed to use it for a long time. 
Some were worried that others would quite naturally mistake it for 
the real thing. What some people did to get around this was to serve 
the nondairy creamer in its carton—even though this was tacky 


looking. 
IV CONCLUSION 

My account of mimetically immoral (and moral) actions is obviously 
consequentialist. But that is because it is clear that what makes mi- 
metically immoral acts immoral is their consequences. There is 
nothing intrinsically wrong, or immoral, with these actions. This 
analysis bolsters consequentialist ethics because that approach leads 
one to the correct analysis of these actions. The spirit of this ap- 
proach runs counter to deontology which famously maintains that 
immorality is not exhausted by consequences. Of course, deontolo- 
gists do not disregard consequences altogether—but they maintain 
that by and large other factors matter morally as well. Thus, a deon- 
tologist could agree with my analysis of mimetic immoralities by 
maintaining that consequences are for these actions what deter- 
mines their moral quality. Still, by being the fruitful theory here, 
consequentialism gains some ground. Rather than lead us to a plau- 
sible analysis, deontology merely responds to cases as they crop up. 

To return to Caesar’s wife, Pompeia: if she believed that her ac- 
tions could be misconstrued in such a way as to cause harm to her 
husband, or to corrupt others by her example, she had a moral 
reason not to do them. Being good is sometimes a matter of look- 
ing good. 

JULIA DRIVER 

Brooklyn College/CUNY 
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SEMANTIC SUPERVENIENCE AND REFERENTIAL 
INDETERMINACY* 


Y topic in this paper is the bearing of semantic superve- 

nience (the thesis that semantic facts supervene on nonse- 

mantic facts) on currently influential arguments for the 
indeterminacy of reference. I sball argue that semantic superven- 
ience is a two-edged sword: it is an essential weapon in some attacks 
on referential determinacy (e.g., that of Saul Kripke’s Wittgenstein), 
but it may also be used to undercut the main points against referen- 
tial determinacy which have been urged by Hilary Putnam. This is 
because semantic supervenience leads to semantic externalism, a 
doctrine I shall expound and defend by analogy with externalism in 
epistemology. 

I. ARGUMENTS FOR INDETERMINACY 


It will be instructive to begin by considering the general form of 
many arguments for indeterminacy. 


1. Supervenience: an A-statement, if true or false, is true or false in 
virtue of some B-fact. 

2 Underdetermination: the totality of B-facts does not suffice to de- 
termine a truth value for certain A-statements. 


Therefore, 
3. Indeterminacy: some A-statements lack determinate truth value 


The two premises may seem at first glance to be inconsistent with 
each other. Supervenience (or rather, its inverse) sometimes goes by 
the name ‘determination’; to say that one stratum of truth or fact 
supervenes on another is to say that the latter determines the 
former. So how can supervenience be combined with underdeter- 
mination? The answer is that it can be so long as one does not think 
of supervenience as automatically involving bivalence for the super- 
venient domain. Standard definitions of supervenience leave the 
question of bivalence open; they simply require that truths in the 
supervenient domain, to whatever extent there are truths in that 
domain, have determining grounds or sufficient conditions in the 


* I am indebted to Felicia Ackerman, Dorit Bar-On, Simon Blackbum, John 
Heil, Eli Hirsch, Jaegwon Kim, Michael Resnik, Ernest Sosa, Carl Posy, Peter van 
In , a colloquium audience at the University of North Carolma/Chapel Hill, 
aad the National Humanities Center. 
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subvenient domain.’ When we combine supervenience with under- 
determination, the result is a failure of bivalence, which is precisely 
what is involved in most varieties of indeterminacy.” 

Here are some familiar arguments that exemplify the pattern 
above: 

Indeterminacy of identity. Letting the A-statements be state- 
ments of personal identity and the B-facts be facts about physical 
and psychological continuity, we can see Derek Parfit? as arguing 
from 1 and 2 as premises to 3 as conclusion. In some of the puzzle 
cases he discusses, he maintains that the question ‘Is a certain per- 
son who emerges at the end of process P identical with the person 
who entered it?’ has no correct answer. Other parties to the debate 
can be seen as denying one of the premises. Roderick Chisholm and 
Kripke deny 1—they believe that identity is a “Further Fact”; Ber- 
nard Williams and Sydney Shoemaker deny 2—they espouse reduc- 
tionism without indeterminacy. 

Indsterminacy of the past. Now let the A-statements be state- 
ments in the past tense and the B-facts be facts about present memo- 
ries (or whatever else is admitted as currently available evidence 
bearing on the past). Michael Dummett* has flirted with a position 
that again argues from 1 and 2 to 3: there is no truth about the past 
without present evidence to warrant its assertion; there is no present 
evidence to warrant asserting that Caesar did, nor any to warrant 
asserting that he did not, eat eggs on the day he crossed the Rubi- 


' To forestall confusion, I could perhaps use a label such as ‘conditonal super- 
venience’ for premise 1. But that would fe a redundancy; supervenience as stan- 

understood is already conditional. See Kim, ‘‘Concepts of Su ience,” 
Philosophy and Phenomenological Research, XLV, 2 (December 1984). 153-76, 
esp. the definitions on pp. 168, 165. 

One might wonder haa the validity of the argument from supervenience 
and underdetermination to indeterminacy does not depend on a er premise 
ruling out “multiple base” supervenience, i.e., the supervenience of the A domain 
on two distinct domains, B and C. The idea would be that, even if A is underdeter- 
mined by B, ıt might be fully determined by C, and thus not indeterminate. But 
this idea turns out to be incompatible with my two premuses as already stated If A 
1s underdetermined by B, some A-statements are not rendered true or false by 
any B-facts; so if these statements are determunately true or false nonetheless, 
they cannot be true or false in virtue of B-facts; but that contradicts the premise 
that A supervehes on B. So my two premises are enough by themselves to yield 

i . There is no need to rule out muluple base supervemience, since 
the premises already imply that, if A supervenes on any further domain C, it must 
ibe detevaiined by Coco further than it le‘determined by $. (I thank Kim for 
getting mè to thmk about this point.) 

* Reasons and Persons (New York: Oxford, 1986), pt. 3. 

1 "The Reality of the Past,” in Truth and Other Enigmas (Cambridge: Har- 
vard, 1978), pp. 358-74. I simplify somewhat, but I think this argument captures 
the gist of one position considered by Dummett. 
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con; therefore, there is no truth either of his having eaten eggs that 
day or of his not having done so. 

Indeterminacy of position/momentum. Quantum mechanics does 
not allow us simultaneously to assign exact position and exact mo- 
mentum to a particle; one or the other of these two values must, in 
any given case, be underdetermined relative to the total QM state 
description. If we now add (what Einstein was never willing to admit) 
that QM is “complete,” in the sense that it tells us the whole story 
about what is going on at the microlevel, the result is indeterminacy: 
a particle that has determinate momentum will lack determinate 
position, and a particle that has determinate position will lack deter- 
minate momentum. Once more, we have argued our way to indeter- 
minacy from a combination of premises that amounts to superven- 
ience plus underdetermination. 

Indeterminacy of reference. To come at last to the topic of this 
paper, I believe that W.V. Quine, Kripke’s Wittgenstein, and others 
can be viewed as using an argument of the form I have displayed to 
establish the indeterminacy of reference. The A-statements in this 
case are statements of the form ‘term x refers to object y’. The 
B-facts, for Quine, are facts about verbal behavior and dispositions 
to verbal behavior; for Kripke’s Wittgenstein, they are naturalistic 
facts generally, including all the mental facts you like. The two phil- 
osophers I have mentioned may actually use a strengthened form of 
the argument: in no case do the B-facts determine a truth value for 
an A-statement, so there is never a fact of a term’s referring to one 
thing rather than another. 

Quine’s argument for the indeterminacy of translation may also 
be viewed as having the form above. Quine’s formulations of the 
indeterminacy of translation often misleadingly equate it with the 
underdetermination of correct translation by the totality of verbal 
behavior and behavioral dispositions.” But such underdetermination 
cannot be all there is to Quinean indeterminacy. If it were, Quine 
would have no warrant for saying, as he often does, that there is no 
fact of the matter regarding which of two divergent translation man- 
uals is correct; it would be open to others to maintain that there is 
indeed a fact of the matter, holding, perhaps, in virtue of mental 
states not manifested in behavior, or perhaps even holding simply as 
an ulterior “further fact.” To rule these positions out of court, 
Quine must insist that linguistic meaning supervenes on verbal be- 
havior. This is exactly what he does insist upon: “There is nothing in 
linguistic meaning, then, beyond what is to be gleaned from overt 


5 See, e.g., Word and Object (Cambridge: MIT, 1960), pp. 27, 73. 
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behavior in observable circumstances.” Quine also maintains that 
scientific truth in general is underdetermined by the totality of ob- 
servable facts, but in this case he does not espouse the correspond- 
ing supervenience thesis; that is why he is free to hold that there is 
no indeterminacy in our overall theory of nature.” An important 
moral here, generalizable to other problem areas in philosophy as 
well, is that underdetermination alone does not get you indetermi- 
nacy; for that you need underdetermination plus supervenience. 
II. PUTNAM’S MODEL-THEORETIC ARGUMENT AGAINST REALISM 
The indeterminacy of reference plays a crucial role in Putnam’s 
so-called model-theoretic argument against metaphysical realism. 
The tenet of metaphysical realism under attack in this argument is 
that truth is “‘radically nonepistemic,” by which Putnam means that 
even an “epistemically ideal” theory could be false. The essentials of 
the argument may for present purposes be set forth as follows: 


1. Suppose a theory T is epistemically ideal, i.e., meets all operational 
and theoretical constraints, but is false. 

2. Then there is an interpretation J under which T is true. 

3. The metaphysical realist is in no position to say that J is not an 
intended interpretation of T. 

4. So the metaphysical realist cannot say after all that 7, an ideal 
theory, is false.® 


Let me note three points briefly before getting on to the main 
topic. (1) I have omitted mention of the model-theoretic consider- 
ations that give the argument its name. Putnam invokes them in 
passing from 1 to 2, but they do not loom quite so large as his 
exposition would suggest; nor do they raise what I see as the central 


° Quine, “Indeterminacy of Translation Again,” this JOURNAL, LXXXIV, 1 (Jan- 
uary 1987): 5-10, at p. 5. 

7 These points of Quine interpretation are confirmed by “‘Indeterminacy of 
Translation in.” 

* Sources of the argument in Putnam are “Realism and Reason,” Proceedings 
of the American Philosophical Association, L (1977): 483-98, repr. in his 
Meaning and the Moral Sciences (New York: Routledge & Kegan Paul, 1978), 

p. 1238-40, hereafter MMS, Reason, Truth, and History (New York: Cam- 
ridge, 1981), ch. 2, hereafter RTH; and “Models and Reality,” in Putnam, Real- 
ism and Reason (New York: reese 1983), pp. 1-25, hereafter RR. Inter- 
pretations of the argument similar to a ER E 
‘How to Be an Anti-Reahst,” Proceedings and Addresses of the American Phi 
sophical Association, LVI (1982). 47-70; Anthony L. Brueckner, “Putnam's 
ModelTheoretic t against Metaphyaical Reahsm,” Analysis, XLIV, 3 
une 1984): 1 ; and Mark Heller, “Putnam, Reference, and Realism,” in 
tdwest Studies ın Philosophy, xu, Peter A. French, Theodore E Ue Jr. 
and Howard K. Wettstein, eds. (Minneapolis: Minnesota UP, 1988), pp. 113-27. 
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issues in the argument.’ (2) The conclusion as formulated above is 
the weakest conclusion supported by the premises which is incom- 
patible with metaphysical realism as Putnam defines it.’° (3) To what 
extent is this argument an argument against realism? The truth of 
sentences is a function of two things, their interpretation and how 
the world is. What the foregoing argument shows, if successful, is 
that interpretation is indeterminate (at least for the metaphysical 
realist) and that sentential truth is therefore indeterminate (at least 
for the metaphysical realist). Even if Putnam is right about this, it 
would not follow in the least that how the world is is indeterminate. 
A core element of realism therefore remains untouched by Put- 
nam’s argument.’? 

I turn now to the main issue. The heart of Putnam’s argument is 
clearly premise 3, the premise in which referential indeterminacy 
comes to the fore. Why is anyone compelled to accept this premise? 
What prevents the metaphysical realist from saying that / (the con- 
cocted interpretation under which T is true) is an incorrect inter- 
pretation? In support of the premise, Putnam claims that J satisfies 
all operational and theoretical constraints. But many philosophers 
will invoke constraints on reference that go beyond those admitted 
by Putnam. In particular, many will invoke causal constraints—a 
term t as used by S is correctly interpretable as referring to Ks only 
if there are appropriate causal connections between Ks and S’s use 
of the term. More generally, some will contend that reference is to 
be analyzed somehow in terms of a naturalistic relation R, which 
may include causal relations and other ingredients as well: 


x refers to y if and only if x bears R to J. 


This is the schema for a proposal of Hartry Field’s as Putnam intro- 
duces it (RTH, 45). 

Putnam is well aware of this possibility and says the following 
things by way of objection to it: 


° As David Lewis points out, without using the Loewenheim-Skolem theorem, 
one can already show that any large enough world can satsfy any consistent 
theory What the Loewenheim-Skolem theorem adds 1s simply that any denumer- 
able world is large Sponge. See Lewis, “Putnam's Paradox,” Australasian Jour- 
nal of Philosophy, , 3 (September 1984). 221-36, at p. 229. 

10 in a longer version of this paper, I take up the question whether the argu- 
ment can be made to yield the stronger conclusion that an ideal theory must be 
true. 

1 van Inwagen makes a similar pomt about Putnam’s argument concerning 
brains in vats, see his “On Always Being Wrong,” Midwest Studies ın Philo- 
sophy, xu, pp. 95-111. According to van Inwagen, Putnam's arguments have 
everything to do with what it takes for a person to be intentionally related to a 
propoation and nothing to do with what it takes for a proposition to be true. 
Compare also Lewis, p. 232 ‘The metaphysics of realism survives unscathed.” 
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A causal theory of reference would not help, “for how ‘causes’ can 
uniquely refer is as much of a puzzle as how ‘cat’ can, on the metaphysi- 
cal realist picture” (MMS, 126); “the reference of ‘x bears R to yis 
itself indeterminate” (RTH, 45); “it ıs a puzzle how we could learn to 
express what Field wants us to say” (RTH, 46) 


This is the “just more theory” objection. If the interpretation of T 
was indeterminate before, it will remain indeterminate if we add 
Field’s account of reference to T, for the account itself suffers from 
the same indeterminacy. The theory-cum-account has interpreta- 
tions unintended by Field that make it true, but there is no way to 
Tule these interpretations out of court—there is nothing to make it 
the case that they are “really” unintended. 

Several of Putnam’s critics have thought that the foregoing objec- 
tion to the causal theory is completely wrongheaded. They have 
pointed out that in putting forth the causal theory, one does not 
“just add more theory”; one poses an external constraint on refer- 
ence, a constraint that works from outside the theory rather than 
within it. Thus David Lewis writes: 

The constraint is mot that an intended interpretation must somehow 

make our account of C [the constraint] come true. The constraint is 

that an intended interpretation must conform to C itself (op. cit., p 

225). 


It is not causal language that fixes reference; it is causation itself. 

To the exasperation of his critics, Putnam has refused to see this 
point. He reports it accurately enough, but then more or less simply 
repeats what he has said before.'? I hold no brief for the causal 
theory, but I believe the critics are essentially right. Moreover, it is 
important to see that they are right, since the ploy used by Putnam 
against the causal theory could be used against any external con- 
straint on reference, thereby dooming us to accept Putnam’s own 
conclusions—conclusions that many find incredible. In the hope of 
getting more people to see the externalists’ point, perhaps including 
Putnam himself, I propose to pursue some analogies between episte- 
mology and semantics. 

II EPISTEMOLOGY AND SEMANTICS 

One of the major tasks of epistemology is to articulate epistemic 
principles—principles of the form ‘If ____, then subject S is justi- 
fied in believing proposition p’. Such principles divide into two 
classes. One class includes principles that warrant inference from 
already justified propositions to further propositions; the anteced- 


12 See RR, introduction, esp. xi, and 295-6. 
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ents of such principles will specify that certain propositions already 
have some epistemic status for the subject. But not all epistemic 
principles can be like this. There must also be a class of epistemic 
principles that specify the nonepistemic conditions under which 
some beliefs come to have some epistemic status or other in the first 
place—the conditions, one might say, under which epistemic status 
is generated. Examples of principles belonging to this second class, f 
which I shall call generation principles, are Descartes’s principle 
that whatever is perceived clearly and distinctly is certain, Chis- 
holm’s principle that whatever is self-presenting is evident, Alvin 
Goldman’s principle that a belief formed as the result of a reliable 
process is justified, and the coherentist’s principle that a belief be- 
longing to a suitably coherent set of beliefs is justified. As the last 
two examples should make clear, there is nothing inherently foun- 
dationalist in the requirement that a theory of knowledge include at 
least one generation principle. The requirement is really just the 
requirement of epistemic supervenience—that there be some non- 
epistemic features that ultimately underlie the instantiation of any 
epistemic property.’® : 
A corollary of epistemic supervenience is epistemological exter- 
nalism: the view that there are factors that contribute to knowledge 
and make it possible regardless of whether they themselves are 
known. Causal and reliability theories of knowledge are perbaps the 
best known externalist theories in epistemology, but it is worth 
pointing out that even a theory like Descartes’s is, in an important 
sense, externalist. For Descartes’s theory specifies a condition that 
gives rise to knowledge—namely, clear and distinct perception—re- 
gardless of whether it itself is known to obtain. One way to see the 
externalist implications of epistemic supervenience is as follows: a 
generation principle says that the obtaining of whatever çondition is 
specified in its antecedent is sufficient for the accruing of whatever 
epistemic status is specified in its consequent (knowledge, justifica- 
tion, or what have you). That means that nothing else is necessary; in 


13 For further discussion of generation ciples, see my “Foundationalism, 
Epistemic Principles, and the Cartesian Ene The Philosophical Review, 
LXXXVII, 1 (January 1979): 55-91 For more on epistemic supervenience, see 
Sosa, “The Foundations of Foundationalism,”” Nows, XIV, 4 (November 1980): 
547-65, as well as various of the earber and later essays in Knowledge in hale Seed 
tive (New York: Cambridge, 1991); Alvin Goldman, ‘‘What ts Epistemic J 

tion?,” m Justification irs Knowledge, George S. Pappas, ed. (Boston: Reidel, 
1979), pp 1-23; and my “E Supervenience and the Circle of Belief,” The 
Monzst, LXVII, ‘| (January 985): 90-104. 
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particular, knowledge that the condition obtains is not necessary." 
Thus, a Cartesian knower does not have to know that his state is one 
of clear and distinct perception; Goldman’s knowers do not have to 
know about the reliability of their own cognitive processes, and 
so on. 

The reader is likely by now to have guessed where I am heading. 
Semantic principles in the theory of meaning or reference are in 
important ways ‘analogous to epistemic principles in epistemology. 
Field’s schema is precisely the schema for a generation principle in 
semantics: its right-hand side should be seen as spelling out in non- 
semantic terms the conditions under which certain semantic rela- 
tions first arise.” If semantic phenomena, like epistemic phenom- 
ena, are supervenient on natural phenomena, then there must be at 
least one true principle satisfying Field’s schema—a causal princi- 
ple, as it might be, but if not that, some other principle specifying 
the natural basis for reference. 

Let us suppose, then, that the following is a true generation prin- 
ciple in semantics (I now ignore the other half of Field’s bicondi- 
tional): if x bears R to y, then x refers to y. In light of our discussion 
above, it should be clear that, if this principle is true, all that has to 
happen in order for x to refer to y is that x bear R to y. Nothing 
more is necessary; in particular, it is not necessary that we be able to 
say “what Field wants us to say.” Contrary to what Putnam repeat- 
edly insists upon,’® we do not have to be able to “single out” one 
relation as the intended referent of ‘R’ before the causal theory (or 
the R-theory) can do its job. We can do the singling out later, as I 
shall show in the next paragraph, but we need not do it as a precon- 
dition of the successful working of the theory. One might put the 
point this way: as knowledge makers need not be known, so refer- 
ence makers need not be referred to.!” 


M More accurately: nothing 1s necessary for the consequent beyond what is 
necessary for the ae eran itself. It is quite plausibly arguable in the case of 
many generation principles that their antecedents may obtain without the sub- 
ject’s having to know that they do. 

18 That Field himself sees the schema m this light 1s clear from Field, ‘‘Conven- 
tionalism, and Instrumentalism in Semantics,” Notis, Ix, 4 (November 1975): 
875—405, at p 382. 

16 See, e g., “Why There Isn't a Ready-Made World,” in RR, 205-28; and RTH, 
46 

U For further discussion of the bearing of externalist theories against Putnam's 
views, see Heil, “The Epistemic Route to Anti- Realism,” Australasian Journal of 
Philosophy, LXVI, 2 (June 1988): 161-73; and Heller, op. cit. Heil and Heller 
emphasize epistemological externalism rather than semantic externalism. 
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The analogy with epistemology can be carried one step further. 
Knowledge makers need not be known, I just said, but they can be. 
This will happen whenever knowledge-making states of affairs come 
themselves to instantiate the antecedents of epistemic principles. If 
we should acquire clear and distinct perception about the occur- 
rence of clear and distinct perception, or reliably formed belief 
about the reliability of the way in which a given belief was formed, 
we would come thereby to have knowledge of the occurrence of a 
knowledge maker. I have argued elsewhere that there is nothing 
objectionably circular about this.’* Similarly (as several writers have 
pointed out), if our use of the term ‘R’ were related by R itself to 
instances of R, that would make it the case that ‘R’ refers to R.'® 
There is nothing objectionably circular about this, either; as Michael 
Devitt points out (ibid.), there is no reason why a comprehensive 
theory of reference cannot (and should not) apply to its own words. 

Putnam is aware of the possibility of holding that ‘R’ refers to R 
in virtue of self-subsumption (as I shall call the circumstance in 
which ‘R’ stands in R to instances of R). Leaving aside an objection 
based on the theory of types, he objects to self-subsumption chiefly 
on the ground that many putative reference relations are similarly 
self-subsuming. We can cook up a crazy scheme of reference under 
which ‘tables’ refers to alligators. To go with it, we can concoct a 
gerrymandered relation causation* such that our use of the word 
‘tables’, besides being causally related to tables, is also causally* re- 
lated to alligators. (See RR, introduction and 295-6). Moreover, 
things may even be so rigged that our use of ‘cause*’ (if we do use 
it!) is related by causation* to instances of itself. So on what ground 
could we say that causation and not causation* is what makes for 
reference, and hence that ‘tables’ refers to tables rather than to 
alligators? Evidently, self-subsumption is not enough. 

At this point, it will again be instructive to consider a parallel 
situation in epistemology. I have argued elsewhere that there need 
not be any vicious circularity in arriving at knowledge of epistemic 
principles by a route that uses those very principles. For example, 
the layman’s reason for believing in induction—“‘it works””—can be 
established inductively and serves to justify induction itself." A natu- 


18 “Foundationalism, Epustemic Prinaples, and the Cartesian Circle.” 

19 See, e.g., Brueckner, p. 138, and Michael Devitt, Realism and Truth (Prince- 
ton: University Press, 1984), pp. 189-90. 

% See my ‘Reliability, Jus ion, and the Problem of Induction,” Midwest 
Studies in Philosophy, (x, Peter A. French, Theodore E. Uehling, Jr., and How- 
ard K. Wettstein, eds (Minneapolis: Minnesota UP, 1984), pp. 5 5—67. 
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ral objection to this proposal is that it could be used with equal right 
by counterinductivists to justify counterinduction. Counterinductiv- 
ists believe that, if x% of the As have been Bs in the past, one should 
believe that (100 — x)% of the As will be Bs in the future. Counter- 
induction has had a miserable record of success in the past, but a 
counterinductivist would simply regard this as evidence that it will 
work splendidly in the future. He can cite his own principle in self- 
defense just as surely as the inductivist, but it is absurd (so the 
objection goes) to think that this amounts to any sort of justification 
of it. I trust that the parallel with Putnam’s point about causation* 
will be clear. 

I believe there is a good reply to this objection, whether taken in 
its epistemological or its semantical version. Supposing our world to 
be one in which jnduction is reliable (and counterinduction not), 
and supposing that reliability rather than its opposite is what confers 
justification, induction will be justified and counterinduction will 
not be, even though both are self-subsuming. It is not mere self- 
subsumability that confers justification, but self-subsumability under 
a true principle. (Which is not to say a principle known to be true; 
the principle can state an external condition of justification.) If it is 
true that reliable beliefs are justified beliefs and that induction is 
reliable, then counterinductivists will be out of luck. Similarly, if it is 
a fact that causation has something important to da with reference 
but causation* does not, then causal* theorists will be out of luck. 
Their theory will be mistaken despite being self-subsuming. 

In the last paragraph, I have drawn out an important conse- 
quence of the putative fact that causation but not causation* makes 
for reference. Putnam seems at times to doubt that there could be 
any such fact. In an article replying to Field, he says that to hold that 
reference is fixed by causation rather than by causation* “because 
that’s the nature of reference” is unabashed essentialism.?! And 
elsewhere in response to Field, he claims that the fact, if it is a fact, 
that R underlies reference must be taken as “a kind of primitive, 
surd, metaphysical truth” (RTH, 46). 

I believe that Putnam’s claim against Field should be compared 
with the following: the fact that it is reliability (and not, say, a suc- 
cess rate of twenty, ten, or zero per cent) that makes for justification 
must be taken as a primitive, surd, metaphysical truth. Primitive and 
metaphysical it may be; whether it is also surd I leave to the reader’s 
own consideration.” 


” Putnam, “Reply to Two Realists,” this JOURNAL, LXXIX, 10 (October 1982): 
575-7. 
* A similar point is made by Brueckner, p. 138. 
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IV ANOTHER ARGUMENT FOR REFERENTIAL INDETERMINACY 
There is one more possible point in Putnam’s defense I would like to 
consider. Ernest LePore and Barry Loewer™ defend Putnam against 
some of Devitt’s criticisms by citing a Putnam doctrine that I have so 
far said nothing about: 


Devitt has missed the force of Putnam’s argument that there are alter- 
native reference relations that result in identical assignments of truth 
values at every possible world. If there is a causal characterization of 
reference, there also are alternative characterizations (in terms of 
cause*, cause™, and so on) that agree on the truth values of statements 
at all possible worlds. The queston Putnam asks the MR is why say that 
the account of reference in terms of cause rather than m terms of 
cause is the correct one (ibid., p. 464)? 


The Putnam-endorsed point to which LePore and Loewer are call- 
ing attention is this: the correct scheme of reference for a language 
is underdetermined by the distribution of truth values it yields for 
all sentences of the language in all possible worlds (since competing 
schemes can yield identical distributions). Having noted this, they go 
on to ask the supposedly rhetorical question, what could make one 
rather than another of the competing schemes correct? If their own 
answer (or Putnam’s) is ‘nothing’, note what they (or he) must be 
presupposing: that the reference of terms supervenes on the truth 
values of sentences containing them. (Recall a lesson from section I: 
underdetermination by itself is insufficient to beget indeterminacy; 
to get from underdetermination to indeterminacy, you must assume 
a supervenience thesis as well.) The needed supervenience thesis 
could perhaps be formulated as follows: two schemes of reference 
for a language L do not differ in correctness unless they also differ 
in the truth values they yield for at least one sentence of L in at least 
one possible world.™ If this thesis is granted, the indeterminacy of 
reference swiftly follows. 

But why on earth should anyone accept the thesis? I would like to 
make three points about it. (i) It is obviously not a thesis that causal 
theorists are going to accept. If they do not accept it, they can 


2 “A Putnam's Progress,” in Midwest Studies in Philosophy, xu, pp 459-73. 

M This is very similar to the principle John Wallace expounds and attributes to 
F in “Only in the Context of a tence Do Words Have Any Meaning,” in 
Midwest Studies 1n Philosophy, 0, Peter A French, Theodore E. Uehling, Jr., 
and Howard K Wettstein, eds. (Minneapolis: Minnesota UP, 1977), pp 144-64. 
The principle I formulate would be equivalent to the Wallace-Frege principle if 
the “truth conditions” in Wallace’s formulation were given by assignments of 
truth values in all possible worlds. 
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cheerfully admit Putnam’s underdetermination point. (ii) To claim 
(as the thesis does) that reference is exhausted by its contributions 
to truth conditions is surely to indulge in what Putnam once dispar- 
aged as ‘‘negative essentialism’’: it is to claim that certain constraints 
exhaust the content of a given concept even though they do not 
suffice to determine its extension.” Why is negative essentialism 
about reference supposed to be less objectionable than the ‘“‘un- 
abashed essentialism” Putnam complains of when causal theorists 
say it is the nature of reference to be founded on causation? (iii) 
Finally, how plausible is it anyway to suppose that reference super- 
venes on nothing but the distribution of truth values (in all possible 
worlds) over the sentences of a language? To my mind, that supposi- 
tion is every bit as arbitrary as the following one: reference super- 
venes on nothing but the ratio of true sentences to false ones. If two 
schemes of reference for a language L both make half of its sen- 
tences true and half false (and there are few that do not do this!), 


then they are equally correct. 
V ALTERNATIVES TO SEMANTIC SUPERVENIENCE 


We have seen so far that, if the thesis of semantic supervenience is 
correct, Putnam’s objections to causal theories of reference are in- 
deed ill-conceived, just as his critics contend. Well, then, why could 
he not simply deny such supervenience? In this section, I would like 
to explore the options available to anyone who does deny it. I be- 
lieve that once we are clear about what the alternatives are, the 
doctrine of semantic supervenience will appear in a better light. 

To affirm semantic supervenience is to affirm the following: neces- 
sarily, if any semantic fact ever obtains (e.g., that x refers to y, or 
that sentence S1 means the same as SQ), it does so in virtue of some 
nonsemantic fact. To deny semantic supervenience would therefore 
be to embrace one of the following alternatives as possible: either (i) 
some semantic facts are not made true by any further facts, or (ii) 
they hold in virtue of further facts, but always facts that are them- 
selves semantic. Let us consider each alternative. 

Alternative (i) is accepted by some philosophers, but would cer- 
tainly not be acceptable to Putnam. Brute semantic facts, or seman- 
tic facts that do not hold in virtue of any deeper stratum of fact, are 
precisely his target when he inveighs against “magical” theories of 
reference.” 
ey Mind, Language and Reality (New York: Cambridge, 1975), pp. 


7 This us clear from the dialectical contexts in which Putnam levels the charge 
of magic in the first two chapters of RTH. The terms he uses to explain what he 
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Alternative (ii) calls to mind C. S. Peirce’s strange doctrine of 
signs and interpretants. According to Peirce, a sign signifies an ob- 
ject only through the mediation of a third thing, an interpretant; 
and every interpretant is itself a sign, signifying an object. These 
premises obviously generate an infinite regress (or perhaps a circle, 
if it is allowed that an interpretant may have for its own interpretant 
a sign that has already occurred earlier in the series). There is an 
echo of Peirce’s view in Putnam’s remark that “any concept we may 
have of the relation between a sign and its object is another sign” 
(MMS, 126~—7).%” Nonetheless, I cannot really believe that Putnam 
would wish to accept an infinite regress of interpretants. Peirce’s 
regress is surely a reductio ad absurdum of the premises that gener- 
ate it. 

Here, then, is why Putnam should accept semantic supervenience: 
if he denies it, he must believe either in the very magic he deprecates 
or in something worse, namely, that reference is begotten only by 
reference, either ad infinitum or in a circle.* 

The alternatives before us can be expanded and illustrated fur- 
ther by more analogies from epistemology. Consider the familiar 
problem of the infinite justificatory regress, which arises if we ac- 
cept the following four assumptions: 


1. Some beliefs are justified. 

2 Ifa belief is justified, there is some further fact in virtue of which it 
18 justified. 

8. That in virtue of which a belief is justified must itself either be or 
involve a justified belief. 

4. There cannot be circles in justification. (I shall not attempt to for- 
mulate this assumption precisely, but the need for it is clear.) 


These assumptions are readily seen to generate an infimite regress 
of justification. Confronted with such a regress, we have five op- 
tions. (i) We could accept all four assumptions and the implied re- 
gress. Peirce is sometimes said to have embraced the possibility of 





means by magic—it is the view that the connection between ei) (or 
between representations and what they represent) 1s intrinsic tH 5, 47, 51) or 
necessary (RTH, 3, 17-18, 51)—are not aptly chosen. A primitive relation (hus 
real target) need not be intrinsic, nor vice versa, nor need either of these be 
necessary 
Y Pumam connects the point with Occam rather than Peirce, the Peircean 
overtones are somewhat muffled by his use of the word ‘ ne af 
Strictly speaking, he need only believe in the posstbtlity of these things, but 
Wicca a wo e era peel 


SEMANTIC SUPERVENIENCE 357 


such a regress,” but few other philosophers have taken it seriously. 
(ii) We could deny assumption 1, landing ourselves in total, unmiti- 
gated skepticism. (iii) We could deny assumption 2, positing self- 
justifying or intrinsically justified beliefs. (iv) We could deny as- 
sumption 3. As I shall explain more fully in a moment, that would 
mean embracing epistemological externalism, at least of the minimal 
variety implicated in epistemic supervenience. (v) Finally, we could 
deny assumption 4. That is the option sometimes associated with 
coherentist epistemologies. 

In my view, we must in any case deny assumption 3, thus embrac- 
ing epistemic supervenience. In defense of this stand, I would point 
out that positions that apparently deny one of the other assumptions 
are better construed as denying 3 instead (or perhaps in addition). 
For example, the most plausible coherentism is not the linear coher- 
entism that says belief A is justified because belief B is justified, 
which is justified in its turn because belief C is justified, and so on, 
until we get back around to A; it is the global coherentism that says 
each belief is justified in virtue of its place in the entire system.” 
(Light dawns gradually, or perhaps in a flash, over the whole.) On 
this picture, justification supervenes upon coherence relations that 
are thoroughly nonepistemic, neither involving knowledge nor 
themselves needing to be known. Again, the most plausible founda- 
tionalism does not posit beliefs that are literally self-justifying; it 
posits beliefs that are justified by further facts—an option that does 
not compromise foundationalism so long as the facts in question are 
nondoxastic and nonepistemic.*! It thus denies assumption 3. 

Let us now turn to semantics, which I believe may be viewed in 
quite parallel fashion. As before, there are four assumptions that 
generate an infinite regress and five resulting options. 


1. There is such a phenomenon as reference—there are truths of the 
form ‘term x refers to object y’ (for subject S or in language L, of 
course). 


* See “Questions Concerning Certain Faculties Claimed for Man,” in Wri- 
ings of Charles S. Peirce, Vol. 2, Edward C. Moore, ed (Bloomington Indiana 
UP, 1984), pp. 193-211. What he actually defends is the posmbility that each 
cognition of an object be “determined” by an earlier cognition; it is not so clear 
that he would endorse an infinite regress of justification. 

X I take the terms ‘hnear’ and ‘global’ from Laurence BonJour, The Structure 
of Empirical Knowledge (Cambridge: Harvard, 1985), pp. 89-93, though not 
quite in his senses. My global coherentist accepts epistemic supervenience; Bon- 
Jour’s evidently does not For more on the most plausible coherentism as involv- 
ing supervenience, see Sosa, “The Raft and the Pyramid,” in Knowledge ın Per- 
spective, pp. 165-91, esp. sects. 7 and 8. 

*! I say more about this option in ‘‘Epistemic Supervenience and the Circle of 
Belef.” 
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2. If one thing refers to another, there is some further fact in virtue of 
which it so refers. 

3. That in virtue of which a sign refers must itself be a fact that either 
involves reference or is itself referred to. 

4. There cannot be circles in reference (it cannot be that x refers as ıt 
does only because something else refers as it does and conversely). 


What are the options this time? 

(i) We could accept all four assumptions and the resulting infinite 
regress. This is the view to which Peirce was apparently committed, 
but he himself said the regress was absurd,” as I trust most readers 
will agree. I shall mention an alternative interpretation of Peirce 
presently. f 

Gi) We could deny assumption 1 and be thoroughgoing nihilists 
about reference. I think this is implicitly the position of Putnam and 
Quine, even though they prefer to portray their views less starkly, 
Quine allowing for “relative reference” and Putnam for “internal 
reference.”” 

Gii) We could deny assumption 2 and hold that the reference of 
some items to others is an ultimate, irreducible fact. If the referring 
items in question are words, we would have the position of Plato’s 
Cratylus. If, more plausibly, it is held that words refer only through 
the mediation of thoughts, but that the reference of thoughts is 
primitive and irreducible, we would have the position of Chisholm 
and Franz Brentano. This, of course, is the position Putnam casti- 
gates as magic.” 

Gv) We could deny assumption 3 and embrace semantic superve- 
nience. This, as we have seen, is where the causal theory of reference 
fits into the picture. 

(v) Finally, we could deny assumption 4 and embrace some sort of 
coherence theory of reference. As in the epistemological case, how- 
ever, the most plausible coherence theories will deny not 4, but 3. 
To deny 4—to hold that each of two terms has the referential prop- 
erties it does only because of the referential properties possessed by 


2 My source on Peirce is A. J. Ayer, The Origins of Pragmatism (San Fran- 
cisco: Ereeman & Cooper, 1968), p. 123. 

2 I say more about the surrogates for reference offered by Putnam and Quine 
in ‘The Myth of Relative Reference,” unpublished 

H On p. 110 of Representation and Reality (Cambridge: MIT, 1988), Putnam 
says that Chisholm and Brentano are right to regard intentional properties as 

itive. So does Putnam now believe in magic? More likely, he only agrees with 

Chisholm and Brentano on the negative point that intentional properties are 
irreducible—not also on the positive point that they are features of the universe 
in good objective standing. 


SEMANTIC SUPERVENIENCE 359 


the other—really would be absurd. A more plausible coherence 
theory of reference would deny 3, holding that reference accrues to 
a system of terms when they are related to each other in ways de- 
scribable without using semantic vocabulary. This may have been the 
view that Peirce actually held; it is at any rate the sort of view that 
A. J. Ayer attributes to him.™ It is also arguable, though I shall not 
argue it here, that Kant’s view about the reference of mental repre- 
sentations to objects fits here.” 

Let us now take stock. We have canvassed five options about refer- 
ence, broadly construed to cover both linguistic reference and in- 
tentionality: circle, regress, magic, supervenience, and elimination. 
The circle and the regress can be lumped together as both involving 
the idea that reference is born only of reference—not a winning 
proposition. My own preference is for semantic supervenience, but 
if that proved an illusory hope, I would sooner believe in magic than 
embrace any of the other alternatives. What about Putnam? Magic 
he rejects, as we well know. I believe he would also reject the circle 
and the regress; see, for example, RTH, 43, where he objects to 
explaining reference in terms of intention on grounds of circularity. 
He does not take an explicit stand on semantic supervenience, but 
his dismissal of external constraints on reference depends, as I have 
argued, on denying or ignoring the implications of that doctrine. 
That leaves us with nihilism, or the elimination of reference, which I 
think is the ultimate upshot of Putnam’s views.®” 

VI. CONCLUDING COMPARISON: KRIPKE'S WITTGENSTEIN 
By way of conclusion, I would like to compare Putnam’s attack on 
reference with the argument against reference set forth by Kripke in 


™ Ayer, p. 124 “[N]o individual sgn refers beyond itself. .. [I]ts having a 
reference to such and such an object simply consists in the fact that it figures or at 
least is capable of figuring in a series of signs of such and such a sort [T]he 
series of actual signs must be finite. .. It is their membership of the senes that 
gives the individual signs their meaning, rather than any special meaningful rela- 
tion m which they stand to one another.” 

* It is further arguable that the price we pay 1s a reductive account of the 
objects of reference. Here Kant comes out looking better than Ayer’s Peirce A 

omenalist theory of objects is perhaps not out of the question, but a verbalist 
theory of objects certainly would be! 

* Putnam does not present himself as a nihilist, of course; his ‘internal real- 
ism’ Ee Eee Ey wit He sometimes gives the impression that the 
rescue is to ffect the referents of our terms be not the mind- and 
theory-independent oen of the metaphysical realist, but objects of a closer-to- 
home variety that exist somehow ‘“‘within’”’ bur schen Gee RTH, 52). I think it is 
plain, however, that the difficulties Putnam raises for reference have to do with 
the relation itself and not with its relata Lf the difficulties are at all genuine, they 
make reference indeterminate or nonexistent for any theory, internal realism not 
excepted I say more about this in the paper cited in fn. 33. 


360 THE JOURNAL OF PHILOSOPHY 


his interpretation of Wittgenstein on rule following.” Although the 
argumentative strategies of Putnam and Kripke are superficially sim- 
ilar, I believe there is an important difference—a difference that 
makes Kripke’s argument by far the more formidable. (I shall refer 
to the argument as Kripke’s, even though he says he is only 
expounding ‘‘Wittgenstein’s argument as it struck Kripke” 
(ibid., p. 5).) 

Kripke’s conclusion, as I take Putnam’s to be, is nihilistic: there is 
never a fact of my meaning one thing rather than another by any 
word—e.g., no fact of my using ‘plus’ to refer to addition rather 
than guaddition, an operation that normally coincides with addi- 
tion, but yields discrepant values at certain points. Here is how 
Kripke puts it: 

There can be no fact as to what I mean by ‘plus’, or any other word at 

any time. . . . [T]here is no fact about me that distinguishes between my 

meaning a definite function by ‘+’ .. . and my meaning nothing at all 

Ghid., p. 21). 


In some passages, it appears that Kripke’s core argument can be 
developed along the following lines. (See tbid., pp. 15-22.) How can 
a teacher convey unambiguously to a learner that ‘plus’ refers to 
addition rather than some other operation? Anything the teacher 
says can be misinterpreted; and anything the teacher shows by ; 
means of examples will be compatible with many different extrapola- į 
tions. Now let teacher and learner be contracted into the same indi- ` 
vidual: imagine that I am giving myself instructions to fix what I , 
shall mean by ‘plus’. If any instructions I give to a pupil leave the $, 
meaning of ‘plus’ unsettled, how can I do better with the instruc- 
tions I give myself? How can any instructions regarding the use of. 
‘plus’ pick out one function rather than another that diverges from 
it at a certain point? Giving myself a finite number of examples that 
instantiate the intended function clearly will not suffice; and if I try 
to internalize a rule that determines the output of the function for 
every pair of numbers in its infinite domain, I only invite the ques- 
tion how the rule itself (or the symbols for it in my language of 
thought) are to be interpreted. For example, if the rule is stated in 
terms of counting, I invite the question how it is that by ‘counting’ I 
did not mean quounting, an operation that relates to quaddition as 
counting relates to addition. 

It is at this point that it may begin to appear that Kripke’s argu- 
ment is similar to Putnam’s—and similarly in error. Kripke coins 


~~ 


3 Wittgenstein on Rules and Private Language (Cambridge: Harvard, 1982) 
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‘qu’-words as freely as Putnam attaches asterisks; he repeatedly asks 
how it is that a word has one meaning rather than that of its quoun- 
terpart. It may seem that Putnam and Kripke are both querying the 
interpretation of items that should not be up for interpretation—of 
facts that are simply facts, not items that need stand at either end of 
a semantic relation. 

In fact, however, I believe that Kripke’s argument is apposite in a 
way that Putnam’s is not. Kripke does not irrelevantly ask, as Put- 
nam does, how we can single out the facts put forth as those on 
which semantic facts supervene. He does not query the interpreta- 
tion of the sentences we use in an effort to state the facts belonging 
to the supervenience base. Instead, he challenges the adequacy of 
the facts as stated to determine semantic phenomena. When he asks 
Putnamesque questions about quounting and the like, he asks them 
about thought symbols, which are the very things (along with words) 
whose referring power is in question. 

Kripke’s overall argument has the form I set out in section I: it 
combines a premise of supervenience with a premise of underdeter- 
mination. His defense of the underdetermination premise proceeds 
by criticizing the sufficiency of various candidate facts in which 
meaning facts might be supposed to consist—e.g., facts about my 
qualitative mental history, facts about how I have used an expression 
in the past, and facts about how I am disposed to use it in new 
cases.” His defense of the supervenience premise is much slighter, 
but it is nonetheless there; it comes out in his rejection of the idea 
that meaning facts could be sui generis (op. cit., pp. 51-3). Kripke’s 
argument is thus built on semantic supervenience as an essential 
premise; Putnam’s argument, by contrast, is undermined by it. 

JAMES VAN CLEVE 
Brown University 


% It is also possible to discern in Kripke a general argument to the effect that 
semantic facts cannot flow from natural facts because semantic facts have a nor- 
mativity that no natural facts could bestow. (See, e.g , the article by LePore and 
Loewer cited above ) Kripke urges this point chiefly against dispositional theories 
(see op. cit., pp. 11, 37), but the point could be generalized against any naturals 
tic theory of meaning or reference. An additional premise would obviously have 
to be that the normative never supervenes on the natural. Aa 
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BAYESIAN CONDITIONALIZATION RESOLVES 
POSITIVIST/REALIST DISPUTES 


ARTICULAR positivist/realist disputes are a recurrent phe- 

nomenon in virtually all empirical sciences and scientists reg- 

ularly shift their positions on such issues in the light of new 
observational evidence. Eventually, the weight of the evidence 
usually leads within the science in question to stable general consen- 
sus for or against any particular realist ontology that has been at 
issue. For example, such present general consensus has been 
reached in modern physics against the caloric hypothesis, in favor of 
the atomic theory, against ether theories, in favor of quarks, against 
local hidden-variable theories, in favor of quantized Lagrangian 
field theories, and so on. It is clear that, if we were able to set up 
convincing rational reconstructions of how the weight of evidence 
can logically induce such general agreements in science in one or 
other direction, such an achievement might go a long way toward 
clarifying the structure of the positivist/realist disputes that occur in 
general epistemology. 

Recent philosophers of science, in spite of having been provided 
by mathematicians with an apparently completely general recipe for 
any rational effect of evidence on theory choice, namely, the rule of 
probabilistic inference known as Bayesian conditionalization, have 
generally regarded that recipe as inapplicable to positivist/realist 
disputes, and indeed have often gone further and supposed that any 
probabilistic account of scientists’ beliefs in theories must necessar- 
ily systematically favor positivist positions. For example, that was 
Karl Popper’s main argument for rejecting probabilism and opting 
for realism and improbabilism, and it is Bas van Fraassen’s main 
argument for accepting positivism—he believes positivism is sup- 
ported by probabilism, which he accepts. 

In the following pages, I shall clear up this confusion by offering 
explicit Bayesian analyses of the rational weight of observational 
evidence in various positivist/realist disputes, thereby making it 
clear that the conversions from positivist to realist positions (and 
vice versa) which have occurred on so many occasions in the history 
of science have been warranted by valid probabilistic reasoning on 
the basis of observational evidence—notwithstanding the apparent 
theorems to the contrary which have been cited by philosophers of 
science. My examples will make it clear that such philosophers have 
conflated degrees of belief in quite different propositions. 

In the following pages, I shall use the rule of Bayesian conditional- 
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ization in the following form. Let H be a hypothesis or theory, let E 
be the observational evidence, let HH) be the subjective probability 
that the person in question ascribes to the theory H before the 
evidence E has come in, let HKE, H) be the subjective probability that 
before he knows the truth of E, he ascribes to the truth of E condi- 
tional on the truth of H, and let HE) be the corresponding subjec- 
tive probability that he initially ascribes to E. 


(KE) = AE, DHH) + PE, -Hp H) 
(where —H is the denial of the truth of H) 


Then the subjective probability that he should rationally ascribe to 
H after he discovers the truth of the observational claim(s) E is, 
according to this standard rule of probabilistic conditionalization: 


HH, E) = KE, H)AH)/PAE) 


In this formula, I shall substitute symbols representing various 
specific theoretical and observational claims for H and F, and I shall 
sometimes also need to subscript the ps in order to make it clear to 
which individual subjective probabilities, at some particular time, I 
am referring. 

I. ABSTRACT GENERAL ANALYSIS OF POSITIVIST/REALIST DISPUTES 
This section can be studied before or after the case studies that 
follow. In general, in each specific example of a positivist/realist 
dispute, I shall be concerned with four different substituents for H, 
namely, the detailed realist theory Tp, its observational consequence 
set Tp, the central realist ontological claim R, and the positivist 
denial of R, which I shall call P. 

Tp Will entail Tp, 80 p(Tp) will always and for everyone be greater 
than or equal to p{T,). Tz Will also entail R, so p(R) will always and 
for everyone be greater than or equal to p(T,). P is simply the 
negation of R, so, for everyone, p(P) = p(—-R) = 1 — AR). P (while 
always entailing — Tp) neither entails nor is entailed by Tp, however. 
Depending on the circumstances and the individual whose beliefs 
are at issue, p(P) may be less than or equal to or greater than p(Tp). 
This is not widely enough appreciated. 

The philosophical conflation of P and Tp, and hence of p(P) and 
~(Tp), bas in fact formed the greatest obstacle to any real under- 
standing of what is going on in positivist/realist disputes. In fact, 
observational evidence often, as we shall later see, lowers p{P) while 
raising p{Tp), or lowers (Tp) while raising p(P). Indeed, such sce- 
narios prove to be the most typical ones. 
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This should have been obvious from the history of science, 
namely, that the more success a realist theory has in its observational 
predictions, the more scientists will be inclined to accept its central 
ontological claims, i.e., p(T p) rises and p(P) falls, and the more it 
Tuns into trouble with its observational predictions, the more scien- 
tists will be inclined to call those central ontological claims in ques- 
tion, i.e., (Tp) falls and p(P) rises. I shall show that precisely such 
changes are required by ordinary applications of Bayesian condi- 
tionalization. 

In applying this formula, I shall freely ascribe degrees of belief/ 
subjective probabilities to scientific hypotheses and theories. In gen- 
eral, such hypotheses and theories will not be decidable assertions, 
so they could not be directly betted upon. There is no more reason, 
however, why we should insist on simple operational definitions for 
the theoretical parameters in epistemic theories than we should for 
the theoretical parameters in physical theories. Indeed, it seems to 
me a subjective fact that the degrees of belief we assign to decidable 
statements are nearly always in practice introspectively derived from 
those we assign to more general less decidable statements. Further- 
more, de Finetti’s representation theorem shows (pace de Finetti) 
that assigning degrees of belief to decidable statements (e.g., out- 
comes of finite numbers of coin tosses) often commits us to assign- 
ing degrees of belief to undecidable statements (e.g., degrees of bias 
of coins).! 

The only difficulty I recognize about assigning degrees of belief to 
scientific theories arises from the vagueness often inherent in such 
theories, e.g., where it is not specified in advance precisely what the 
intended scope of a particular such theory is, so that one can envis- 
age possible worlds, or indeed future scientific scenarios, in which it 
would be largely a matter of future convention whether we decided 
that the original theory had turned out to be true or false. But such 
problems, when they arise, can always be solved by further initial 
specification of the theory. 

My later worked examples will indeed presuppose that even the 
positivist assigns some small initial probability to R and to Ty. Here 
I would say that, even if the positivist claims on the basis of some 
theory of meaning that R is meaningless (actually this is practically 
never the case with real scientific examples, until philosophers start 
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sticking their oars in), it would still be altogether unreasonable for 
the positivist to assign a probability of 1 to the correctness of his 
favored theory of meaning. (For he would be crazy to offer us infi- 
nite odds that he will not himself abandon that theory after more 
careful analysis of its implications.) So his p(R) still should not come 
out as actually 0. In any case, anyone who wishes to do so could read 
all the negations in my later examples as indicating falsity-or- 
meaninglessness and the calculations would still go through. 

In any specific example, the central realist ontological thesis R 
itself is often initially rather vague and may become more precise as 
Tr is further articulated. This may lead to eventual identification of 
R and Tp, or, if Tp is itself developing internal paradoxes or run- 
ning into observational difficulties, to a fairly sharp later separation 
between R and Tp. Where R itself remains vague it is often best to 
identify belief in R with a belief in the not-yet-exhausted.-theoretical 
(and ultimately predictive) fruitfulness of a particular analogy. But 
in other cases a belief in R, which has become modified and quali- 
fied in the course of time as a result of further development of Tp, 
may itself eventually become indistinguishable from a belief in Tp. 

-As Tx is further articulated, Tp, the set of T,’s observational 
predictions, will in general grow. Also, Tp and hence Tp will usually 
be modified from time to time to improve the empirical fit of these 
predictions. The associated modifications in and further articulation 
of Tr, and correlated extensions of Tp, may lead to a progressive 
narrowing of the gap between Tp and Tp. That is, the initially obvi- 
ous surplus content of Tp, relative to Tp, may disappear in the 
course of time, having been progressively cashed by modifications 
and refinements of Tr and correlated extensions of Tp. In such a 
case, a belief in R may finally become indistinguishable from, and 
logically reducible to, a belief that all actual and all possible observa- 
tions would always satisfy a certain particular set of far-reaching 
mathematical constraints built into Tp. And the positivist rejection 
of R, and thus acceptance of P, would then become identical 
with skepticism as to the universal, timeless, and counterfactual- 
embracing validity of this particular set of far-reaching and inexor- 
able mathematical constraints on all possible observations. 

The disagreement between the realist and the positivist here 
would thus essentially reduce to an issue of how far one was entitled 
inductively to extrapolate from the hitherto actually performed ob- 
servations. Thus, it would simply be a matter of different degrees of 
belief in Tp. But these differences might of course only arise in 
relation to parts of Tp’s full content which were altogether remote 
from its practical content with respect to actual and technically feasi- 
ble observations. In general, a Tp will include predictions of the 
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observational outcomes along all branches of a vast tree of action 
choices, including all sorts of alternatives that are in practice mutu- 
ally exclusive, and including specifications of what would have been 
the observational outcomes of unperformed possible past actions. It 
is only thus in a highly rarified sense that I shall here suggest that 
positivist/realist disputes are in an ultimate analysis reducible to 
differences of degrees of belief in sets of observational claims. 

Furthermore, in these ultimate limiting cases Tp will typically be a 
theory on which we can obtain no finite grasp except via its Tp 
axiomatization. That is, we will know as mathematicians that all the 
theoretical terms we use are in principle eliminable, and actually 
observationally definable, and merely at the cost of a more compli- 
cated axiomatization. But this cost may be a completely prohibitive 
cost, not only for ourselves but possibly for any finite system. So, if 
we wish to believe in Tp, the only way to do this is to believe in Tp, 
and we will know from our mathematical metatheorems that this 
belief in Tp is in fact metaphysically innocent, we are not commit- 
ting ourselves to anything not in principle reformulable in positivis- 
tic language. So the realist is really just a positivist who is happier 
about far-reaching observational extrapolations. 

This is not the situation in lower-level positivist/realist disputes. 
At earlier stages in their history, realist theories Tz do have surplus 
content over their observational consequence sets, Tp. But this sur- 
plus gets cashed in terms of further observational consequences as 
the realist theory is further articulated. The positivist is generally 
more skeptical than the realist concerning the empirical scope and 
accuracy of the current Tp, and he certainly does not expect further 
observational successes as a result of further articulation of Tg; but 
the realist does; or the positivist expects fewer such further observa- 
tional successes than the realist does. These are the real scientific 
disputes between positivists and realists. They are not metaphysical 
disputes. Subsequent evidence favors one party or the other, and 
Bayesian rationality requires corresponding belief shifts. 

O HISTORICAL EXAMPLES 

Consider the case of early nineteenth-century realism concerning 
atoms and its early nineteenth-century positivist critics. Let the real- 
ist theory of atoms be Tz. Let the corresponding positivistic obser- 
vational theory, in this case the laws of constant and multiple pro- 
portions, be Tp. We note that Tp, in this as in all cases, goes well 
beyond the known experimental data. In fact, in this case Tp is a 
very far-reaching extrapolation from fragmentary and in some re- 
spects conflicting known experimental data. Tp entails Tp but Tp 
does not entail Tg. Tp could be true and Tp false. 

For example, although Tp, if it turns out to be true without ex- 
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ception, might be regarded as entailing some funny kind of quanti- 
zation rule governing the proportions of elements in compounds, 
that rule alone would certainly not entail any computable limit on 
the divisibility of matter of a given sort. Nor even if such a limit were 
computable, on the basis of further observational evidence and an 
extension of the theory, would Tp entail that the corresponding 
hypothetical smallest constituents of given macroscopic materials 
could be consistently localized relative to one another in ordinary 
three-dimensional space. Relative location in three-dimensional 
space might prove to be not even well-definable at the submicrosco- 
pic level in question. Nor according to Tp need it be the case that we 
could legitimately ask whether such hypothetical ultimate constitu- 
ents were moving, and if so how fast, and so on. Tr, on the other 
hand, would be false, even if Tp remained true, if the world were 
such that future observations would eventually generate irresolvable 
inconsistencies with the assumptions implicit in Tg that atoms have 
finite sizes, have locations relative to each other in space, and so on. 
Thus, while there may be a certain amount of vagueness in what Tp 
precisely entails, it certainly embodies inter alia an empirically 
stronger constraint on future observations than does Tp as here 
identified. Thus, at later stages in the development of atomic theory 
we shall need a new Tp that asserts more than the laws of constant 
and multiple proportions. For the moment, though, I shall restrict 
myself to the original version of Tp. 

For what follows, it is crucial to appreciate the following logical 
relations. Tz, the realist theory, entails R, the realist ontological 
position. R may also entail Tg, as is plausible in this historical exam- 
ple, so it does no harm provisionally to identify R and Tp. On the 
other hand, P, the positivist ontological position, does not entail Tp, 
nor does it entail the negation of Tp: to reject the existence of atoms 
one is committed neither to accepting nor to denying the validity of 
the laws of constant and multiple proportions. P is thus simply —R, 
and hence in this case, provisionally, identical with —Tp. (Any 
philosopher who has qualms here as to whether positivists merely 
assert the falsity of realist ontological claims or actually claim they 
are meaningless can simply read the symbol ‘—’ as ‘false or meaning- 
less’ without affecting the validity of the folowing arguments.) 

On the other hand, Tg does entail Tp. Indeed, even indepen- 
dently of our possibly over-simplified identification of R with Tp in 
this historical example, R itself here pretty clearty entails Tp. So in 
any case it follows (from a theorem of elementary probability theory) 
that Tp is at least as probable as, or more probable than, Tr. But 
nothing follows from that about the relative (subjective) probabili- 
ties of P and R, or of P and Tg. In particular, Tr can still be more 
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probable than —Tp, hence, a fortiori, R more probable than P. 
Thus, the Popper/van Fraassen inference from the relation between 
p(T) and p(Tp) to the conclusion that positivist ontological claims 
have always to be as or more probable than realist ontological claims 
is of course nonsense. . 

Now, let us subscript our ps according to whether they represent 
the degrees of belief of the realists, ppr( ), or those of their positivist 
opponents, pp). Let the realist at a given point in time assign a 
subjective probability of 0.6 to Tp: pr(Tp) = .6, that is, the realist 
thinks that it is a little more likely than not that Daltonian atoms 
really exist. Hence, 


Pr(Tp) = PAT r, Tr) TR) + pr(Tr, -Trp Tr) 


according to a standard identity of the probability calculus. Hence, 
Pr(Tp) = 1x.6 + pr(Tp, —Tp)x.4. What is the realist’s estimate of 
the only unknown here, namely, the subjective probability that he 
would assign to the truth of the laws of constant and multiple pro- 
portions were he to believe that Daltonian atoms did not really exist, 
thus merely on the basis of the direct observational evidence so far 
available? Let us for the purpose of discussion take this as 0.2. It 
would follow that pp(Tp) = 1x.6 + .2x.4 = .68. As Popper and van 
Fraassen and others have observed, the realist thus ascribes here a 
higher probability to the positivist version of the theory than to the 
realist version. Nevertheless, the realist here trivially assigns a higher 
probability to Tg than to —Tp and hence, a fortiori, to R than to 
—R, hence to R than to P. 

What of the positivist? Suppose at the same point in time he 
ascribes a subjective probability of only 0.2 to the existence of Dal- 
tonian atoms. Then pATp) = 1x.2 + .2x.8 = .36. Notice that the 
positivist and the realist are here assumed to agree on how probable 
the laws of constant and multiple proportions would be merely on 
the basis of the direct observational evidence. In this, though not in 
all, positivist/realist disputes, this assumption is historically plausi- 
ble. Notice also that .36 is smaller than .68, thus, given these particu- 
lar conjectural historical subjective probabilities, the positivist is not 
only less sure than the realist of the exact truth of the observational 
laws, namely, the laws of constant and multiple proportions, but in 
this particular case he thinks that they are more probably than not 
false (for example, that future evidence will produce counterexam- 
ples). This is again historically plausible for this example. 

What happens when further observational evidence for the truth 
of Tp is obtained? Well, the maximal further observational evidence 
that could be obtained would be observational evidence that 
brought the subjective probability of Tp arbitrarily close to 1. Let us 
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see what effect this would have on the degrees of belief of the realist 
and the positivist. What we need to calculate is thus, by Bayesian 
conditionalization, 


PalTm Tr) = PriT rm Tr)Pe(Tn)/Pa(Tp) = 1x.6/.68 = roughly .9 
and 
PAT m Tp) = PAT p, Tr)PAT e)/PrlTr) = 1x.2/.36 = roughly 6 


In other words, the realist is merely strengthened in his realist be- 
lief, but the positivist has actually converted to atomic realism. 

The reader is invited to rework these calculations for himself with 
other numerical inputs. He will find for example that, if the positiv- 
ist had ascribed an initial subjective probability of only .] to the 
truth of realism but .2 (as in our example) to the initial probability 
of the laws of constant and multiple proportions purely on the basis 
of the observational evidence, then he would never actually convert 
to realism purely on the basis of further observational evidence for 
those particular observational laws, though he would come close to 
it. On the other hand, if he had assigned a higher initial “purely 
observational probability,” say, .6, to those laws, he would need to 
have started with pp(Tp) close to .4 to have eventually converted to 
realism concerning atoms on the basis of further observational evi- 
dence for the truth of those observational laws. g 

The latter numerical example has also the interest that it illus- 
trates a situation where the positivist does, even initially, regard the 
observational laws as more likely to be true than not (though not as 
likely as the realist considers them) and he still will convert eventu- 
ally to realism, given sufficient further evidence for them. This 
shows that the accident in my original example that the positivist 
began by thinking the relevant observational laws were more likely 
than not false (which I regard as a plausible reconstruction of the 
actual historical situation with respect to this positivist/realist issue) 
is by no means essential to the possibility of subsequent rational 
conversion to a realist position. 

So far I have treated the laws of constant and multiple propor- 
tions as “purely observational.” That is of course an oversimplifica- 
tion. ‘‘Observationality” is always relative to the particular positiv- 
ist/realist dispute at issue. A more extreme positivist could certainly 
detect theory ladenness in these laws, and in relation to him we 
would have to reanalyse the situation with respect to a yet more 
positivistic observation language, and the corresponding “‘observa- 
tional” laws as formulated within that more restricted observational 
language. 
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It might seem from this example that the direction of conversion 
is always eventually from positivism to realism. That of course is not 
true. Not only would a refutation of the observational laws derivable 
from the realist theory refute that realist theory, leaving the positiv- 
ist position intact, but also observational evidence could accrue 
which would diminish the probability of those observational laws 
without actually refuting them. Indeed, this is what happened for a 
period after Dalton’s original publications: the observational laws in 
fact ran into trouble with alloys and this trouble was not cleared up 
until alloys were clearly distinguished from both compounds and 
mixtures. So for a time p(Tp) decreased for everyone, and probably 
enough—a good approximate calculation of the rational opinion 
change in question can be constructed simply by applying Jeffrey- 
conditionalization on a lowered value of pp(Tp)—for some realists 
concerning atoms briefly to reconvert to the positivist position. 

There also arose further observational difficulties for the realist 
position. Gay-Lussac’s observational laws on combining volumes of 
gases seemed, when conjoined with the atomic theory, to require the 
Avogadro-Ampére hypothesis. But that hypothesis seemed at the 
time very implausible. Quite apart from the difficulty of under- 
standing the required polyatomic rather than monatomic molecules 
of even elementary gases, the existing static atomic theories of tem- 
perature and pressure of gases failed to render the Avogadro- 
Ampère hypothesis intrinsically plausible; and such incipient kinetic 
theories of gas pressure as were then available suggested rather that 
gas pressure at a given temperature ought to increase directly with 
the masses of the molecules, because this would, for given speeds of 
molecules, increase the momentum transfer, and hence pressure, at 
the boundaries. Formally, with A as hypothesis, and G as Gay- 
Lussac’s observational laws, the crucial issue was whether p(A, Tp) 
was not perhaps even smaller than p(G, —T,), where the latter 
corresponded roughly to the estimated chance, independently of 
the atomic theory, of there being a further mysterious quantization 
rule corresponding to Gay-Lussac’s findings. At the time many ato- 
mists appear to have estimated the value of b(A, Tp) as so small that 
they were more inclined to regard Gay-Lussac’s original results as an 
experimental artifact than to welcome them as additional further 
support for Tp. 

Of course, nearly fifty years later, with the development of the 
modern kinetic theory, p(A, Tp) was drastically revised upward, and 
as a result, after the clarification of the chemical situation by Canni- 
zarTo, H(G, Tz) became much larger than p(G, —Tp), hence G now 
provided, precisely in accordance with Bayesian conditionalization, 
further strong evidence favoring realism concerning atoms. 
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The later predictive successes of structural atomic chemistry, and 
the increasing agreement between estimates of the size of atoms 
obtained from very different calculations based on very diverse 
classes of phenomena, provided further evidence E for which 
p(E, Tr) was very much greater than p(F, —Tpr), and hence, by 
Bayesian conditionalization, further reason to accept realism con- 
cerning atoms. 

In the last quarter of the nineteenth century, the remaining 
doubts concerned not so much whether matter took on a discrete 
structure at very small dimensions—even Ostwald seems to have 
accepted that already in the 1880s—but rather whether the vast 
velocities imposed by the kinetic theory on these smallest discrete 
parts were established beyond reasonable doubt by the observa- 
tional predictions to which they led. There were at least three difh- 
culties here: (i) some of the predictions (e.g., concerning specific 
heats) seemed either wrong or to require fantastic hypotheses con- 
cerning the dynamical structure of molecules; (ii) in the case of the 
major successful predictions of the molecular-kinetic theory, there 
was a strong suspicion—supported by authorities such as Ostwald, 
Dubem, and Planck—that most of the successful predictions could 
be rederived on the basis of purely thermodynamical considerations, 
hence that for these predictions p(E, —Tp) was not substantially 
different from p(E, Tp), where Tp is now the fully extended © 
molecular-kinetic theory; and (iii) early attempts to explain Brown- 
ian motion on the basis of the kinetic theory seemed to give quantita- 
tively completely wrong results, so the evidence of Brownian motion 
seemed at the time to count against rather than in favor of the 
kinetic theory. With B as the quantitative observational features of 
Brownian motion, it seemed that p(B, Tz) was substantially less than 
p(B, —Tp), hence B seemed to be rational probabilistic evidence 
against Tp. 

Problem (i) was cleared up in the early twentieth century with the 
development of the old quantum theory; (ii) was clarified as it gradu- 
ally became clear that the energeticists and thermodynamicists had 
been overly optimistic concerning how many of the predictive 
achievements of the kinetic-molecular theory they could in fact re- 
produce without adding arbitrary extra assumptions to their phe- 
nomenological theory; and (ii) was finally resolved by the theoretical 
work of Einstein and Smoluchowski and the experimental work of 
Perrin. The quantitative details of Brownian motion, B, thus eventu- 
ally became a further piece of evidence for which p(B, Tp) was 
finally recognized as close to unity and p(B, —T,) as very small 
indeed: thus, B constituted observational evidence with enormous 
evidential weight according to Bayesian conditionalization. It was 
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indeed sufficient to convert the earlier positivistic-skeptic Ostwald 
to the molecular-kinetic theory. 

We note that in the course of these developments, Tp, the positi- 
vistic reconstruction of the predictive content of the atomic theory, 
has to be constantly enlarged, and indeed has to have more and 
more quantization features built into it. At the same time, Tp, the 
realist theory of atoms, becomes refined (especially with the develop- 
ment of the quantum theory of the structure of matter) in such a 
way that atoms lose many of the features originally attributed to 
them on the basis of a naive analogy with ordinary macroscopic 
particles. The question in fact arises as to whether this progressive 
strengthening of Tp (which of course by the later twentieth century 
will have to include in its predictive content the results of x-ray 
crystallography, of electron microscopy, of “the making visible of 
individual atoms” using scattered laser light, and so on) and modifi- 
cation of Tp in such a way that its details become more and more 
tied to details of Tp, does not eventually lead to formal mathemati- 
cal equivalence between these two theories. 

Here we must never forget that Tp is a predictive observational 
theory that must contain among its predictions not only all the actu- 
ally confirmed observational predictions of Tp, but also all its actual 
and conditional future predictions, including expectations condi- 
tional on the development of future, as yet unthought-of instrumen- 
tal techniques for the further probing of the hypothetical submicro- 
scopic reality. Could it not be that the modern Tp is simply a more 
economical expression of the modern Tp? 

That there may no longer be a genuine logical gap here between 
the realist and the equivalent positivist theory is further suggested 
by the position of the only surviving antiatomist I have come across, 
namely, C.A. Truesdell. Truesdell’s position—expressed in conversa- 
tion, I do not know whether it is in print—is that the best modern 
atomic theory we have is in fact a quantized field theory of matter in 
which the number of atoms is merely a quantum number, and that 
the nearest classical equivalent to such a theory is not a discrete 
particle theory of matter but a continuous field theory of matter. 
Hence Truesdell maintains that it is really the sophisticated anti- 
atomists rather than the naive atomists who have been subsequently 
vindicated. Whatever the merits of this reading of history, it suggests 
that antiatomists and atomists now believe in essentially the same 
theory and that the difference between them has become at most 
terminological: Tp has been extended so as to include most of the 
extra content of the original Tg, and other initial expectations of Tz 
have been so qualified that the two positions seem no longer clearly 
distinguishable. 
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Surprising though it may seem, there are in fact a few rather clear 
examples of positivist/realist disputes where Tp seems not only to be 
entailed by, but also to entail, Tg. For example, in Mach’s reaxioma- 
tization of classical Newtonian mechanics, although masses are elimi- 
nated from his positivistic axiom system—which speaks instead of 
inverse ratios of mutually induced accelerations, which ratios are 
posited to be the same whether mediately or immediately arrived 
at—nevertheless the existence of masses, unique up to a choice of 
unit, is then trivially provable within his axiom system. (Patrick 
Suppes’ later axiomatization failed to yield this result because it 
assumed a funny observational language; see fn. 2.) The difference 
is that ‘mass’ does not occur as a primitive term in Mach’s positivistic 
formulation of the theory. While Mach’s T> is thus actually formally 
equivalent to Newton’s Tp here, the positivist ought to be less cer- 
tain of the truth of either than is the realist. In particular, the posi- 
tivist here can, and I would say must, take very seriously the possibil- 
ity that the fact that the ratios of accelerations in question turn out 
the same whether mediately or immediately arrived at may turn out 
to be true merely as some kind of first-order approximation, or to 
possess a class of exceptions: in which case realism concerning 
masses would be false: for the observational laws would fail to satisfy 
the mathematical conditions required for the existence of the mass 
functions. Thus, in this case the choice of axiomatization is related 
to the theorist’s suspicions as to where the theory is likely to 
break down. 

The realist over masses need not be absolutely confident that the 
orthodox theory of mechanics will never require future modifica- 
tion, but he does not expect it to require future modification of 
such a kind that masses will cease to be well-defined parameters; 
or, at the very least, the positivist considers this a much more 
likely eventuality than the realist does. Hence, at least in this re- 
spect, we expect pp(Tr) = pr(Tp) to be substantially less than 
Pr(Tr) = pr(Tp). It then follows by Bayesian conditionalization that 
observational evidence—e.g., further and further evidence that Tp 
(or the relevant part of Tp) really is exactly correct and not merely a 
good first-order approximation in our actual universe—will shift 
the positivist’s beliefs in the direction of realism. For Tp, or the 
relevant part of it, is itself here construable as observational evi- 
dence O such that p(O, Tx) is greater than p(O, —Tp). 

It is historically likely that Mach’s positivism concerning masses is 
of a less strong sort than the positivism (shared by Mach) that we 
have previously considered concerning atoms. That is, it seems to 
me plausible here to reconstruct Mach’s position as assigning to Tp 

and Tp here a subjective probability of, say, 0.6, compared with the 
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realists’ assignment of, say, 0.99. The difference is one of degree of 
confidence, Mach’s degree of confidence not being such that he is 
willing to choose a formulation of the theory which cannot easily be 
modified should the observational laws required for the mathemati- 
cal existence of masses run into experimental difficulties. The tran- 
sition from Mach’s relative positivism to relative realism in this case 
will probably thus occur not at the .5 confidence level but at some 
vague higher point, say, around .9, at which it seems to become 
reasonable no longer to allow, within the formalism, for a rather 
unlikely kind of possible future modification. Clearly, increasing ob- 
servational confidence in the observational theory Tp will directly 
shift Mach’s position here, since p(T) in this case equals p(Tp).” 

There are in fact other examples of the same relation between 
positivist and realist positions. For example, Poincaré once reax- 
iomatized Newtonian gravitational theory taking relative rather than 
absolute accelerations as a primitive. The axiom system required 
turned out slightly more complicated than the usual one, but of 
course one could prove within it that there existed a set of reference 
frames such that accelerations relative to that set of frames satisfied 
precisely the usual axioms governing absolute accelerations. In 
other words, in this case too, Tp turned out not merely to be en- 
tailed by, but also to entail Tp. In fact, Poincaré seems to have been 
satisfied enough of the truth of the relevant observational content 
of the orthodox realist theory here not to argue that his more posi- 
tivistic reaxiomatization yielded a preferred choice of axiom 
system." 


? Readers who have been persuaded by the arguments of Suppes and others 
that Machian-style reaxiomatizations fail to eliminate the theoretical entities they 
are supposed to eliminate are referred to my ‘The Eliminability of Masses and 
Forces in Newtonian Particle Mechanics: Suppes Reconmdered,” in the British 
Journal of the Philosophy of Saence, xxvn, 1 (1977): 55-7. One can add to the 
objections presented in detail there, the objection that simple Padoa’s principle 
arguments do not go through in our actual universe because it lacks the neceasary 
perfect symmetries exploited in those arguments. Sneed’s conjectures concerni 
Ramsey-noneliminability also do not in my view stand up to careful 
scrutmy The best neo-Machian positivistic axiomatization of Newtonian mechan- 
ics is to be found in the first part of G W. Mackey’s Mathematical Foundations of 
Quantum Mechanics (New York: Benjamin, 1963), and its only slight awkwardness 
m ending up with masses which could be different in different directions is easily 
avoided by unposing rotational invariance on any laws from the beginning. The 
observational definability of ordinary theoretical terms is enureły clear m this 
presentation 

3 Further examples of positivistic reaxiomatizations of standard physical the- 
ories, where the existence and uniqueness of the eliminated theoretical entities 1s 
rederived within the positivistic re-axiomatization, can be found in Robin Giles’ 
studies of the mathematical foundations of thermodynamics and of quantum 
theory. 
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A further instructive example of the same kind is the relation 
between Einstein’s original positivistic formulation of special relativ- 
ity theory based on the Lorentz transformation equations and the 
realist formulation of that theory in terms of Minkowski space-time 
geometry. The entities taken to represent underlying physical reality 
in the space-time geometrical formulation are simply the invariants 
of the transformation group relating reference systems, i.e., of the 
transformation group that is taken as fundamental in the more posi- 
tivistic formulation of the theory. But clearly the truth of these 
transformation laws mathematically entails the existence of the 
space-time geometrical invariants in question. So here again Tp en- 
tails Tp as well as vice versa. 

Those more positivistic theorists who regard the transformation 
equations as more fundamental than the corresponding space-time 
geometrical formulation (a formulation in which observer’s-frame- 
of-reference dependence is no longer a feature of any of the primi- 
tive concepts) probably do so because they envisage a future in 
which those transformation equations will be seen as merely part of 
a wider group of transformations to which the dynamical laws of our 
universe are subject and which will require a deeper theory than 
space-time geometry for its explanation. Such a theory would even- 
tually set limits on the validity of space-time geometry and of those 
transformations, and perhaps also on the observer-independence of 
the properties and concepts we need to take as fundamental. Real- 
ists concerning Minkowski space-time geometry, on the other hand, 
are more confident that essentially geometrical explanations of the 
Lorentz transformations will remain fundamental in future theory, 
and refuse to regard special relativity theory as providing any evi- 
dence for expecting any ineliminable observer-dependence as a fea- 
ture of future fundamental theories in physics. It seems thus that 
while both groups of theorists will already admit that, given the 
success of general relativity, both Tp and Tp must be regarded here 
as of only local validity, the positivists are less confident of both of 
these, for they suspect that Tp is only a small part, valid to a high 
approximation, of a future more global theory. Hence the situation 
here is rather similar to the situation with respect to Mach’s positiv- 
ism concerning masses in classical mechanics. 

These examples show that it is possible to make sense of positivist- 
realist disputes even in cases where the realist theory is in fact logi- 
cally and mathematically formally equivalent to the positivist theory. 
The issue here between the positivists and realists is in fact a matter 
of degree of confidence in the theory in question. We can identify 
this theory indifferently with Tp or Tp since here these mutually 
entail each other, so for both realist and positivist, p(Tp) = p(T). It 
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is p(T) (= po(Tp)) which is less than pa(Tp) (= pr(Ta)). Contrary 
to what one might expect, pp(Tp) is not greater than pp(Tp) nor is 
Pr(TR) greater than pp(Tp). 

The question therefore seriously arises in the light of these exam- 
ples whether it may not also be in the case of modern atomism that, 
as a result of convergence in the content of Tp and Tp, we have 
reached a point in which the same mutual mathematical entailment 
holds as in the cases of Machian (and neo-Machian) and Newtonian 
formulations of classical mechanics, Poincaréan and Newtonian for- 
mulations of classical gravitation, and Einsteinian and Minkowskian 
formulations of special relativity. 

Precisely the same question arises as to the relations between phe- 
nomenalism and realism concerning the external world. May not a 
full phenomenalist Tp, were we able to write it down, not merely be 
entailed by, but logically and mathematically actually entail, the 
corresponding Tp? The logical relationship could be like that be- 
tween a large table of correlation coefficients and the small number 
of factors resulting from a complete factor analysis of that table. 
Clearly this latter example shows that a mere mathematically more 
economical redescription of the data can be viewed as having consid- 
erable explanatory value, even though this is neither a Hempelian 
nor a Popperian explanation, since the explanans has no greater 
scope than the explanandum. 

But there is an even more direct mathematical analogy between 
the epistemological phenomenalism-realism issue and the Einstein- 
ian and Minkowskian formulations of special relativity. In both cases 
we seem, on the one hand, to have a phenomenological theory that 
describes and relates all possible perspectives and, on the other 
hand, a realist theory that posits an objective multidimensional ge- 
ometry that is in fact built up precisely out of the invariants mathe- 
matically generated by the corresponding perspectival theory. For- 
mally, one expects the two theories to be equivalent and mathemati- 
cally to entail one another. This is perhaps not obvious in the case of 
phenomenalism relative to the external world or positivism with re- 
spect to modern atomic theory merely because no one has taken the 
trouble formally to construct the Tps in question, for they would so 
obviously be clumsy formulations compared with the corresponding 
realist formulations. But there is every reason to suppose that the 
set of all possible perspectival views of a group of objects (whether 
or not supplemented by a topology associated with possible trajec- 
tories of observers through this set) would possess a natural topol- 
ogy that was three-dimensional and that the ordinary realist descrip- 
tion of the shapes and sizes and relations of those objects in three- 
dimensional space would simply be derivable as the mathematical 
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structure of the mathematical invariants underlying the correspond- 
ing perspectival group of transformations. Ordinary realist claims 
would thus be recovered as mathematical theorems within this phe- 
nomenalist description. (And extension of this phenomenalist data 
set to inchide proprioceptive sensations would not change the logi- 
cal situation, but merely generate multiple derivability of the realist 
theorems.) 

If these extrapolations from mathematically better understood 
situations are correct, the difference between a positivist and a real- 
ist in general epistemology is not so much a matter of their having 
different expectations as in their differing in the confidence with 
which they hold these expectations. This confidence can be raised or 
lowered, we have seen, in the ordinary way on the basis of favorable 
or unfavorable observational evidence, by the rule of Bayesian con- 
ditionalization. Thus, positivist/realist disputes do not differ essen- 
tially in their logical structure from any other more obviously purely 
empirical disputes in science. Bishop Berkeley is simply less confi- 
dent than the realist that his re-entering-the-study-like experiences 
will always be followed by as-if-perceiving-his-desk appearances. 

The positivist in any particular dispute may have perfectly reputa- 
ble reasons for having less trust than the realist in the very far- 
reaching observational extrapolations from the actual observational 
data required by that particular realist position. For example, the 
phenomenalist may have little confidence in the ultimate success of 
physiological explanations of dreaming and of mystical experiences 
and, for reasons of his own, may consider an ultimate mentalistic 
ontology more plausible than an ultimate physicalistic ontology: he 
is thus less committed than the realist to the view that all future 
observational data will conform to a set of laws whose invariants are 
those required by common-sense physical realism. The success of 
microrealist theories of a macrorealist world constitutes of course 
further evidence in favor of macrorealism, for the additional law- 
like constraints that such success imposes on the observational data 
can only be regarded as having arisen by mathematical accident 
from the point of view of the phenomenalist or solipsist. Hence 
p(E, R) is greater than p(E, P) for this evidence E. Berkeley had in 
fact doubts about the internal consistency of the microphysical the- 
ories of his day, which doubts enabled him to attach less weight to 
this argument than it might otherwise have carried. 

While Berkeley’s doubts concerning the microrealist physics of his 
day have been more or less resolved by subsequent developments, 
new doubts have arisen in the present century with respect to our 
latest fundamental theories. Realism requires that all true observa- 


378 THE JOURNAL OF PHILOSOPHY 


tional laws be consistent with and explicable on the basis of an 
observer-independent reality. This imposes a quite strong mathemat- 
ical constraint on those laws. The laws of the quantum theory appar- 
ently fail to satisfy this constraint, as can be seen from a consider- 
ation of either the no-hidden-variable-theorems or the quantum- 
mechanical measurement paradox. Prima facie, such a situation thus 
provides observational evidence against an observer-independent re- 
ality, hence against realism and favoring phenomenalism or (taking 
into account the problems involved in giving a relativistically- 
invariant, observer’s-frame-of-reference-independent account of 
wave-packet collapse in measurement) even solipsism. There are a 
number of theoretical options, however, none of them very plausi- 
ble, which remain open to the microrealist as a possible escape from 
the relevant no-go theorems. So, while the evidence from the 
strange observational laws embodied in the quantum theory 
weakens the case for realism—for these observational laws O, p(O, 
R) is apparently much less than p(O, P}—and while some theoretical 
physicists have apparently converted on the strength of this evi- 
dence to versions of phenomenalism or solipsism, many others re- 
main unconvinced, presumably because their prior confidence in 
realism was initially higher, and they still hope to find an acceptable 
interpretation of the quantum theory which frees it from commit- 
ment to a fundamentally observer-dependent reality. For them, 
p(R, O) still remains greater than p(—R, O). 

I have argued so far as if the realist position, R, can always be 
identified with the realist theory Tz, although the positivist position, 
P, cannot generally be identified with the positivistic theory Tp. This 
simplification obviously requires certain qualifications. In the quan- 
tum mechanical case, we do not have any convincing Tp available: 
the realist merely hopes that such a theory will become available in 
the future. And even in the case of atomism, we saw earlier how Tz 
became changed and extended in the course of time, incorporating 
the kinetic theory and later specific theories of atomic structure; so 
that, if we restrict R here merely to realism concerning the existence 
of atoms, R clearly can no longer be directly identified with Tp. 
Indeed, it would already obviously have been an oversimplification 
to identify R with a belief in every detail of Dalton’s original Tg. So 
in these cases, while R does not really entail Tp, Tp still entails R, so 
if Tg becomes subjectively more probable than not by Bayesian con- 
ditionalization, a fortiori so does R. 

There are also cases where, surprisingly, Tp is either unavailable, 
or, where it exists, is less probable than not, given the truth of R, 
and where, contrary to the situation in our original examples, cer- 
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tain evidence for Tp counts against rather than for the truth of R. 
Quantum-mechanical realism, where Tp is usually at best programa- 
tic, may be an example of this, since at least some realist programs 
here actually require that some of the observational predictions of 
the standard quantum theory be false. For example, the experimen- 
tal tests of the Bell inequalities excluded certain realist options that 
previously had their defenders. This situation where R has to be 
sharply distinguished from Tp arises acutely in situations where the 
realist theory Tp is a makeshift devised to reconcile certain originally 
unexpected aspects of a successful observational theory Tp with the 
ontology required by R. 

A simpler example than quantum-theoretical realism is the neo- 
Lorentzian ether-theoretic realist interpretation of special relativity 
theory. Here, Tx is indeed set up so as to yield the observational 
equivalence of all inertial reference frames demanded by Einstein’s 
Tp. But R itself, realism with respect to the ether, requires the 
existence of a theoretically preferred frame, and is thus inconsistent 
with a belief in the universal validity of the observational laws em- 
bodied in Tp, and hence, a fortiori, requires that the present Tp, 
which is set up so as to yield precisely Tp, can only be some sort of 
approximately valid theory. So in this case more and more observa- 
tional evidence for Tp, and hence for Tp, actually favors the posi- 
tivist’s rather than the realist’s theoretical position with respect to 
the ether This is the opposite situation to the one we met with in the 
case of the atomic theory and in the case of ordinary common-sense 
realism. Formally the situation is easy to understand, however. If O 
is new observational evidence casting doubt on the equivalence of all 
inertial frames, p(O, Tp) = p(O, Tp) is very small and approximately 
equal to p(O, P) = p(O, —R), whereas p(O, R) is substantially larger; 
hence p(R, O) is greater than p(P, O), and Bayesian conditionaliza- 
tion would here yield a shift of opinions in the direction favoring the 
ether-theoretic realists. Conversely, further evidence favoring the 
observational equivalence of all inertial frames—for example, new 
evidence discrediting that particular aforementioned O or its 
theoretical interpretation—would cause a rational opinion shift 
in the direction of the positivist position with respect to the neo- 
Lorentzian ether. 

Similar situations arise with respect to a few other positivist/real- 
ist disputes in physics: for example, one form of realism concerning 
vector potentials in electromagnetism would insist not only on their 
mathematical existence (which is provable within the corresponding 
positivist theory) but on their uniqueness modulo a smaller class of 
transformations than the full class of gauge transformations. But, 
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like the uniqueness of the neo-Lorentzian ether frame, this ontologi- 
cal conjecture has to be stated as an expected mathematical prop- 
erty of some future modified or extended observational theory: it 
cannot simply be imposed on the current Tz, since the Tp which 
that entails yields uniqueness only modulo the full class of gauge 
transformations: one could thus not impose further mathematical 
uniqueness within the current Tg without generating a mathemati- 
cal contradiction within that Tx. 

A failure to distinguish P (= —R) from Tp has confused most 
philosophical discussions of positivism and realism. But in these last 
examples, identification of P and Tp would be innocuous; philosoph- 
ical confusion could easily be produced, however, by a failure to 
distinguish R (= —P) and Ty. 

Attempts have been made in some of the physics literature to 
illuminate the peculiar features of some positivist/realist disputes by 
attempting to draw a distinction between the purely mathematical 
existence and real physical existence of the disputed entities or to 
reconcile their mathematical nonuniqueness with their supposed 
physical uniqueness. In my view, this only leads to obscurantism; for 
all these disputes can be fully clarified without introducing such 
doubtful, and probably ultimately unintelligible, distinctions; it is 
enough to take into account theorists’ full future-observational ex- 
pectations and the degrees of confidence they associate with these. 

The same kinds of formal and evidential considerations apply, I 
believe, to all positivist/realist disputes in science and epistemology. 
It makes little ultimate difference whether the realist ontology in 
dispute is initially introduced on the basis of an analogy, as in the 
case of atoms, the ether, or other minds, or merely on the basis of a 
mathematically more economical reformulation of the observational 
theory based on exploitation of the higher-order-invariants struc- 
ture of the original observational laws, as in the case of forces, po- 
tentials, wave functions, three-dimensional spatial geometry, or (3 + 
1}-dimensional space-time geometry. 

In the former cases, Tp is initially certainly stronger than the 
initial set of observational laws Tp that Tp was introduced to ex- 
plain. But this distinction can be expected to evaporate in the course 
of time with the introduction of empirical qualifications on the origi- 
nal analogy and extensions of Tp to incorporate and render precise 
all the more vague and indirect observational expectations implicit 
in the original Tg. For example, even in the case of the other-minds 
issue, Tg must allow for empirical differences between my mind and 
those of others, and Tp must be extended to include physiological as 
well as behavioral evidence, even including the possibility of future 
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electrophysiological gadgetry allowing rather direct access to an- 
other’s thoughts and feelings. And here the ultimate success of a 
detailed functionalist theory of the mind rendering Tp and Tp ulti- 
mately fully equivalent cannot be altogether discounted. Even if this 
is not achieved, there remains no obvious bound on the posterior 
probability of the realist theory which can be reached here by succes- 
sive Bayesian conditionalization on new behavioral and neurophysio- 
logical evidence. It is only with an unchangeable and unextendable 
Tp and an unqualifiable Tg that p(Tp)/p(Tp) could remain fixed, 
and less than 1, in such a way as to set a bound on the maximum 
posterior probability of Tp. I can find no plausible examples of 
positivist/realist disputes involving such a constraint. 

The widespread philosophical prejudice to the effect that realist 
conclusions can never be mathematically derived from positivistic 
axiom systems is not only open to numerous mathematical counter- 
examples, as we have already seen, but seems to me to be based on a 
false analogy going back at least as far as Plato. The analogy is that 
of the projection of the real three-dimensional world onto a single 
flat two-dimensional surface, the world of observations. But is it not 
obvious that, if we had such projections onto arbitrarily many dif- 
ferent such surfaces, the three-dimensional reality might be 
uniquely mathematically reconstructable? A similar mathematical 
prejudgment was made in medieval astronomy when it was assumed 
by positivistic astronomers that astronomical observations could 
only determine the projections of the relative planetary positions 
onto the background sphere provided by the fixed stars, hence that, 
though the latitudes and longitudes of the planets were empirically 
meaningful, their altitudes (i.e., relative distances from the earth) 
were not. But here it only needed the realization that astronomical 
observations can be made from different points on the earth’s sur- 
face for it to become geometrically clear that the planetary altitudes 
are in principle fully determinable (indeed, mathematically multiply 
overdetermined) by the parallax effects at the observational level. 
The full set of observational projections is more than sufficient to 
determine the actual locations of the planets in physical space de- 
manded by the realist theory. 

In the history of science there are many similar examples of realist 
conclusions turning out, surprisingly enough, to be logically deriv- 
able from positivist premises. A celebrated example is Heisenberg’s 
matrix mechanics as contrasted with Schrédinger’s wave mechanics. 
The former was explicitly set up as a strongly positivistic theory in 
which the only observables were essentially the frequencies, intensi- 
ties, and polarizations of spectral lines. Schrédinger’s theory, on the 
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other hand, was set up as a realist theory describing the distribution 
of electronic mass and charge within the atom through its wave 
functions for stationary states. To Heisenberg’s considerable sur- 
prise, the mathematical existence of the Schrédinger wave functions 
proved to be logically derivable within his own explicitly positiv- 
istic theory. (This example should not be confused with the later 
positivist-realist issues concerning the quantum theory. Heisen- 
berg’s original positivistic program that led to the setting up of 
matrix mechanics was much more extreme—more restricted in its 
observation language—than any later ‘‘Copenhagen’’ positivistic 
position.) 

The more usual situation is, of course, where the realist theory 
comes first and only subsequently is more positivistically reaxioma- 
tized. No mathematical physicist is then surprised when the exis- 
tence, and uniqueness modulo the relevant groups of transforma- 
tions, of the functions and parameters whose existence the realists 
merely posited proves mathematically derivable within the more 
positivistic axiom systems. 

Realist ontologies, I thus suggest, probably ultimately simply 
posit the existence of mathematical objects (functions, invariants, 
and so on) whose existence would be actually mathematically re- 
quired by a suitable set of positivistic observational laws, provided 
those full observational laws did indeed rigorously possess the exact 
mathematical structure required for the relevant existence and uni- 
queness theorems. Sufficient observational evidence will then (as 
the history of science teaches us) resolve, by ordinary Bayesian 
probabilistic inference, any particular positivist/realist dispute to 
any desired level of subjective confidence. I thus conclude that posi- 
tivist/realist disputes are ultimately not metaphysical disputes at all 
but rather. ultimately disputes as to the legitimacy of certain far- 
reaching observational extrapolations from the existing observa- 
tional data, and that such disputes are thus resolvable (to any de- 
sired degree of convergence-of-opinion) by sufficient relevant ob- 
servational evidence, in accordance with the ordinary rules of 
probabilistic reasoning. 

JON DORLING 
University of Amsterdam 
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Dear Carnap, Dean Van: The Quine-Carnap Correspondence and 
Related Work. W. V. QUINE and RUDOLF CARNAP. Richard Creath, 
ed. xv + 484 p. Berkeley: University of California Press, 1991. 
Cloth $34.95. 


W. V. Quine was twenty-four and Rudolf Carnap forty-one when 
they met in a hospital in Vienna at the end of 1932. That is perhaps 
the best age, and age difference, for mutually productive collabora- 
tion between teacher and pupil in philosophy. Ludwig Wittgenstein 
and Bertrand Russell were similarly related during their nch ex- 
changes in Cambridge twenty years earlier. It might be interesting to 
see if other examples bear out the generalization. Quine followed 
Carnap back to Prague and had what he has described as his ‘‘first 
experience of sustained intellectual engagement with anyone of an 
older generation, let alone a great man.” By then he had already 
studied at the Harvard of Sheffer and C. I. Lewis, and had written 
his Ph.D. thesis with A. N. Whitehead. In Prague he eagerly listened 
to Carnap’s lectures on what was soon to become The Logical Syn- 
tax of Language, and on off days Carnap graciously invited him to 
his home for long discussions of the new work. Carrying the message 
back across the Atlantic, Quine gave three lectures at Harvard in 
1934 to explain to a slightly wider world the basis of Carnap’s con- 
ception of philosophy—or whatever was valid in what used to be 
philosophy—as syntax. 

Those lectures have been part of the lore of Emerson Hall for 
many years, and it is very good now to see them in print. They 
appear here as the first fifty-odd pages of this thick volume, fol- 
lowed by the fruits of a correspondence between the two philoso- 
phers lasting almost thirty-eight years until Carnap’s death in Sep- 
tember 1970. The book concludes with Quine’s “Homage to Rudolf 
Carnap,” presented to the Philosophy of Science Association a 
month later. It is the only previously published material here. 
Richard Creath contributes a long introduction to explain to the 
uninitiated what the philosophical “battle” (as he calls it) between 
the two friends was all about. 

These are not men to bare their souls to each other. There is 
obviously affection and great respect between them. They exchange 
expressions of genuine best wishes, warm regards, and occasionally 
love, with wives and children included. There are a few lively notes 
from Carnap’s wife, Ina. The fortunes and whereabouts of the sev- 
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eral Quine children through the years are warmly communicated. 
There is much news, especially on Quine’s side, of travel. There are 
signs of academic maneuvering, especially in the earlier years when 
Quine is trying to get Carnap a job in the United States. His first 
choice is naturally Harvard, and he gives Carnap the names of ‘“‘al- 
lies” among influential university figures to whom he should send 
“all future easily understood publications.” Quine later asks Carnap 
to drop a word in the right places when he finds his own depart- 
ment’s recommendation of his promotion rejected and thinks of 
leaving Harvard. Carnap obliges, but it turns out to be unnecessary. 

There are mentions of health from time to time, almost exclu- 
sively physical, and mainly in connection with Carnap’s chronic back 
troubles. Reports of meetings with mutual friends, mostly on Car- 
nap’s part, are largely restricted to what philosophical issues were 
discussed. A strong ideological solidarity which usually goes without 
saying surfaces in Quine’s invocation of “the hard-headed, anti-mys- 
tical philosophical temper which all of us share” and in his reference 
at one point to “the enemy camp so to speak.” The bonds are real, 
but remain largely unstated, or perhaps unexplored. Aside from a 
one-sentence announcement of his separation from his first wife, 
the long pain of Quine’s divorce recorded in his autobiography is 
not mentioned here. Carnap hears indirectly about Quine’s second 
marriage and sends him belated warm wishes and the unelaborated 
reassurance that “from my own experience I am a great believer in 
second marriages.” There is a brief note of sympathy from Quine 
after Ina’s tragic suicide much later in 1964, but nothing to indicate 
that Carnap ever said or wrote anything to Quine about it. This is 
primarily a philosophical correspondence. 

Much of it consists of detailed responses of one author to the 
work of the other, or to the other’s reactions to earlier responses. 
When Carnap is still in Europe they even exchange copies of gram- 
matical and stylistic corrections made to letters written in the recipi- 
ent’s but not the author’s native language; all are duly recorded 
here. Philosophically, the two go through each other’s work line by 
line. Marginalia on the letters are also reproduced here, and a sub- 
sequent letter usually makes the point at greater length. In a small 
but reassuring number of cases, the objection is quickly conceded: a 
mistake was made. To follow the more complex responses for full 
scholarly understanding of the history of these matters the reader 
would also need at hand a copy of the text under discussion. Pub- 
lished writings of Carnap and Quine would not always suffice. Aside 
from differences of pagination, they are typically the outcome of 
these exchanges, not in every respect the object of them. 
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There is heavy concentration throughout on what look like ques- 
tions of terminology. It starts in 1934 with Carnap seeking advice 
about English equivalents of crucial logical terms for the translation 
of Die logische Syntax der Sprache. By 1939 he is sending long ques- 
tionnaires not only to Quine but to Alonzo Church, Ernest Nagel, 
Carl Hempel, Helmer, and Alfred Tarski, trying to get general 
agreement on a terminology for semantics and on English terms for 
a translation of his Abriss der Logistik. In 1943 he sends Quine and 
others another long questionnaire on the best terminology for 
various expressions and what he calls their “meanings,” e.g., propo- 
sitions, properties, concepts, individual concepts, and the like. This 
raises the suspicion that in some cases perhaps more than terminol- 
ogy has been at stake all along. Quine attempts to identify substan- 
tive disagreements between them on just these notions, but Carnap 
in all reasonableness does not “quite understand” what he has in 
mind; they look to hint like different “proposals” for constructing a 
semantic theory. Very close study of these frank and careful ex- 
changes would be needed to locate and identify the real sources of 
philosophical difference between Carnap and Quine. There is more 
direct give-and-take than in their finished published writings, but 
concessions made here and there for the sake of argument might 
also at times obscure differences that turn out to be important. 

The heart of the book as it bears on this question is the rich one 
hundred pages which begin with a letter early in 1943, after a gap of 
two years with only a single postcard to show for it. Quine writes 
from Washington, where he is commuting daily from the outskirts to 
full-time work in Navy radio intelligence while apparently philoso- 
phizing by night. It is the time of the publication of his “Notes on 
Existence and Necessity.” He reports that reflection on Carnap’s 
recent Introduction to Semantics has led him to greater clarity on 
the points “crucial to semantics” at which he dissents from it. An 
earlier draft had been the subject of discussion in 1940 during that 
momentous period when Carnap and Tarski were both visiting Har- 
vard (along with Russell, whose William James Lectures of that year 
became An Inquiry into Meaning and Truth). Carnap replies from 
El Paso and then from Santa Fe where he has been flat on his ailing 
back for many months while working continuously on what was to 
become Meaning and Necessity. 

He first provides a “comparison of terminology” with expressions 
in the ‘Quine’ column paired off with their corresponding expres- 
sions in the ‘Carnap’ column, and after suitable translation finds 
their views on semantics to be “much closer to each other” than it 
had seemed. Quine replies with a masterly two-part letter occupying 


386 THE JOURNAL OF PHILOSOPHY 


twenty of these pages in which he explains and defends his objec- 
tions in “Notes on Existence and Necessity” and responds to Car- 
nap’s explanation of their differences about “meaning” and “‘desig- 
nation.” In the meantime Carnap has found what he thinks is a way 
to avoid all the puzzles Quine has raised about his approach to 
semantics by restricting himself now to “the concept of extension 
and intension” alone. When the new two-hundred-and-fifty-page 
typescript of that material reaches him, Quine reports that he 
“found it very interesting. By this I mean that I sat down with it on 
coming home from work and didn’t let go of it, except for a brief 
meal, until after proper bedtime. By that time J had read two-thirds 
of it: it had a grip on me like a mystery thriller. This evening I 
finished it.” He confesses to needing more time for the fundamental 
issues to “ferment” in him, but in what remains of that wartime 
evening he is still able to jot down eight more pages of “critical 
remarks.” The subsequent letters back and forth lead up to a state- 
ment of Quine’s objections which Carnap asks him to supply for 
inclusion in the published version of Meaning and Necessity. 

What is finally most impressive about this correspondence, 
beyond the high quality of thought, is the energy and concentration 
sustained throughout, and the desire and above all the will on the 
part of both philosophers to get things right. In reply to a pub- 
lisher’s inquiry in 1961 about the advisability of translating and re- 
publishing Carnap’s Der logische Aufbau der Welt, Quine, who 
greatly admires the book, cannot resist adding that “at important 
points it is simply wrong: objectively so, quite apart from questions 
of philosophical party.” Philosophy is conducted in these letters in 
that same objective spirit. That is not something it would ever have 
occurred to either of these correspondents to point out to the 
other, or to request of him. But it must have been a continuing 
source of the great but unspoken respect they so clearly felt for 
each other. 

BARRY STROUD 

University of California/Berkeley 
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METAPHYSICAL PLURALISM* 


avid Hume is the St. Francis of modern metaphysics, the 

patron saint of ontological ascetics. The devout Humean 

takes the pinnacle of metaphysical virtue to be a world in 
which the only facts are the mundane first-order physical facts about 
how things actually are—‘‘a vast mosaic of local matters of particu- 
lar fact, just one little thing and then another,” as David Lewis’ puts 
it. Hume’s own ontological austerity derives in part from his empiri- 
cism, in part from the great medieval Franciscan metaphysician, 
William of Ockham, and in part no doubt from the canny inclina- 
tions of his native land. Whatever its original sources, however, the 
doctrine remains enormously influential. The Humean ideal contin- 
ues to exert great force in contemporary philosophy. 

Like prosperous Franciscans, however, many metaphysicians who 
pay lip service to the virtues of ontological economy have strayed 
from the true path. The Hume world has frequently been judged 
too cramped. Some lapsed Humeans extend the bare Humean 
structure to fit in modal facts; some extend it to fit in subjective 
experience or intentional mental states; others to fit in moral facts; 
and so on. The common theme is that a respectable metaphysics 
simply cannot survive on the bare regime that Hume preacribes. 

Now, the galling thing about a prosperous Franciscan is not the 
fact that he is not suffering the discomforts of poverty, or not this as 
such. It is his moral inconsistency, the fact that he claims virtue 
while failing to live according to his own professed conception of 
the virtuous life. There are two ways to challenge this inconsistency. 


E APENE Go Ana Hoe Ge cL Go E S 
and particularly to John Burgess, John l, John Hawthorne, Mi lis 
Michael, Philip Pettit, Michael Schepanski, and Mark Walker, for many comments 
on earlier versions 

' Philosophical Papers: Volume IT (New York: Oxford, 1986), p. 1x. 
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One is to demonstrate to the Franciscan that he could make do with 
less, and hence establish that by his own lights he is not leading the 
pious life. This is in effect the strategy of those who dispute the need 
for particular extensions to the Hume world, arguing that the pur- 

such extensions serve can be met in some less extravagant 
manner.” The alternative deflationary strategy is more subtle. It is to 
accept the case for the extensions, for the Franciscan’s additional 
consumption, but to point out that the consequence of accepting 
this is to undermine the minimalist conception of a virtuous lifestyle. 
In accepting that the Franciscan is fully entitled to satisfaction of the 
normal human appetites, in other words, we undercut his claim to 
abnormal piety. 

It may not be obvious that there is any analogous move in the 
metaphysical case. In this paper I want to show that there is, and to 
suggest that this has profound consequences for our understanding 
of a range of contemporary metaphysical debates. Roughly, its ef- 
fect is to undercut the distinction between various non-Humean 
forms of metaphysical realism and something akin to a Wittgenstein- 
ian linguistic pluralism. To the extent the distinction can be drawn, 
moreover, the latter is the default position. So, not only is this form 
of pluralism an important and widely neglected option in a range of 
contemporary metaphysical debates; it actually has claim to be the 
pre-eminent option. In a sense I shall explain, it is the philosophical 
geodesic, the course from which no one is entitled to depart without 
good reason. The paper thus presents a challenge to the lapsed 
Humeans of contemporary metaphysics: embrace worldly pluralism, 
or return to the pure faith, for there is no virtuous middle way! 

The paper is in five main sections. Section I identifies our target 
species of pluralism, and distinguishes it from a more common spe- 
cies. In section II, I consider this target species in relation to a 
variety of antipluralist rivals, including in particular the monist doc- 
trine that seems to be the intended position of the lapsed Humeans; 
and I draw attention to the central issue of the paper, namely, as to 
whether there is actually a tenable distinction between this monist 
position and our target pluralism. Section III argues that despite the 
avowedly ontological nature of their concerns, these lapsed 
Humeans cannot avoid reliance on a certain semantic distinction— 
essentially, the distinction between descriptive and nondescriptive 
uses of language—and points out that this puts them at at least a 
prima facie disadvantage compared to the pluralists, who need no 


2 A recent example is Bas van Fraassen’s critique of contemporary non- 
Humean accounts of modality in Laws and Symmetry (New York: Oxford, 1990). 
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such distinction. Section IV then outlines a case for thinking that 
the required distinction cannot be drawn, and draws attention to 
some connections between the resulting sort of pluralism and cer- 
tain other recent approaches to the same metaphysical topics. And 
in section V, I illustrate the advantages and character of such a 
pluralism with reference to the issue between realists and their in- 
strumentalist critics; perhaps surprisingly, this rather Wittgenstein- 
ian pluralism turns out to provide an especially secure brand of 


realism. 
L TWO KINDS OF PHILOSOPHICAL PLURALISM 

One familiar kind of philosophical pluralism is exemplified by W. V. 
Quine’s brand of ontological relativity, and perhaps in a different 
way by other forms of scientific relativism. Here the plurality con- 
sists in the possible existence of a range of alternative scientific 
worldviews, each empirically adequate to more or less the same de- 
gree, and none, even in principle, having privileged claim to provide 
a “truer” description of the world. This form of pluralism itself 
comes in several varieties or subspecies, of course. The distinction 
between Quine’s variety and Kuhnian scientific relativism may well 
be a deep one, for example.’ For present purposes, however, what 
matters is something that these views have in common, namely, that 
the plurality they admit involves a range of different ways of doing 
the same kind of thing, of performing the same linguistic task. There 
may be many equally valid possible scientific worldviews, but all of 
them are scientific worldviews, and in that sense are on the same 
level of linguistic activity. In other words, this is what might appro- 
priately be called horizontal pluralism. 

Horizontal pluralism is far from confined to this scientific level. It 
is a product of certain kinds of relativism, and so may be found 
wherever these flourish. In ethics, for example, it is the familiar 
thesis that there is a range of equally coherent moral viewpoints, 
none objectively superior to any other. Why is this a case of horizon- 
tal pluralism? Again, because the plurality it envisages is confined to 
a single linguistic plane or level. Different moral systems are all 
nevertheless moral systems. They have something in common in vir- 
tue of which they may be counted to be different ways of perform- 
ing the same linguistic task. It may be a very nice question how this 
something in common is to be properly characterized, but it must 
have an answer, if relativism is not to degenerate into the trivial 
point that the same words may mean different things for different 


people. 


* On this see G. D. Romanos, Quine and Analytic Philosophy (Cambridge: 
MIT, 1988), ch 8. 
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If these are cases of horizontal pluralism, what would be a vertical 
pluralism? It would be the view that philosophy should recognize an 
irreducible plurality of kinds of discourse—the moral as well as the 
scientific, for example. This is the species of pluralism with which we 
are going to be most concerned. (I shall mainly use the term dis- 
course pluralism.) I want to show that it provides a natural if not 
initially a congenial home for the lapsed Humean. 

The plurality of forms of discourse, or “language games,” is a 
prominent theme in the later Wittgenstein: “We remain uncon- 
scious of the prodigious diversity of all the everyday language-games 
because the clothing of our language makes everything alike.’’* 
There also seems to be a strong element of discourse pluralism in 
the American pragmatist tradition, of which Nelson Goodman and 
Richard Rorty are the most prominent recent representatives. True, 
Goodman and Rorty’s pluralism is not exclusively vertical. It also 
contains a strong horizontal or relativistic element. The fact that the 
vertical component is significant in its own right is manifest, how- 
ever, in the contrast between the positions of Goodman or Rorty, on 
the one hand, and Quine, on the other. Quine at times seems close 
to a purely horizontal pluralism, to the view that all factual discourse 
is either eliminable or reducible to physical discourse. This view 
acknowledges that there might be alternative physical 
discourses—‘‘alternative physical theories, insusceptible to adjudi- 
cation,” as Quine® puts it—but says that there is nothing else at any 
other level. And certainly Quine rejects Goodman’s proposal to ad- 
mit on an equal footing a multiplicity of further “world versions,” 
such as those of art and music, saying that in his view “‘this sequence 
of worlds or versions founders in absurdity” (ibid.,-pp. 97-8). 

More accurately, however, Quine himself is nonphysicalist to the 
extent of accepting the existence of certain abstract objects, such as 
classes and numbers. This is not to say that he intends to be a dis- 
course pluralist. On the contrary, he is critical of the proposal to 
regard the acceptance of such objects as in some sense radically 
unlike the acceptance of physical objects: 


There are philosophers who stoutly maintain that ‘true’ said of logical 
or mathematical laws and ‘true’ said of weather predictions or suspects’ 
confessions are two uses of an ambiguous term ‘true’. There are philo- 
sophers who stoutly maintain that ‘exists’ said of numbers, classes and 
the like and ‘exists’ said of material objects are two uses of an ambigu- 


4 : . 
emg (rg Investigations, 3rd English ed. (Cambridge: Blackwell, 1968), p. 
224; also §23-4. 


5 Quine, Theories and Things (Cambridge: Harvard, 1981), p. 98 
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ous term ‘exists’. What mainly baffles me is the stoutness of their main- 
tenance. What can they possibly count as evidence? Why not view ‘true’ 
as unambiguous but very general, and recognize the difference be- 
tween true logical laws and true confessions as a difference merely 
between logical laws and confessions? And correspondingly for ex- 
istence?* 


Later I want to argue that this point backfires. Discourse pluralism 
is the default position in this debate, and does not need to be de- 
fended by appealing to a claimed ambiguity in ‘true’ or ‘exists’. On 
the contrary, it is the discourse pluralist’s opponent who needs to 
appeal to some suitably “thick” or substantial unity in these notions 
—and Quine’s preferred thin notions are not up to the task. Hence 
I want to suggest that Quine himself might best be recast as a dis- 
course pluralist about abstract objects. He thus exemplifies what I 
shall argue to be the predicament of many would-be non-Humean 
realists in contemporary philosophy. (The striking thing about 
Quine’s case is that his predicament stems from his own minimalism 
about truth and existence.) 

The first task is to mark off discourse pluralism from a range of 
other ways of dealing with the same philosophical topics. 

II. DISCOURSE PLURALISM AND ITS RIVALS 

As a distinctive philosophical doctrine, discourse pluralism is per- 
haps best understood in terms of what it denies, in terms of the 
contrasts it tries to establish with other philosophical treatments of 
problematic topics. In this section, I describe four such contrasts. I 
distinguish a pluralist treatment of a philosophical topic from four 
other approaches: reductionism, two forms of irrealism, and lastly a 
view I call additive monism, which is the intended position of non- 
Humean metaphysicians. (Later our main interest will be in the ques- 
tion whether an additive monist can resist reconstitution as a dis- 
course pluralist.) 

To establish the general framework, consider a problematic topic 
against the background of an unproblematic topic—morality 
against the background of natural science, say. Assume for simplic- 
ity that the status of the background topic is not at issue. Our first 
group of distinctions rest on different combinations of answers to 
two questions about the status of the problematic topic. First ques- 
tion: Is its (purported) subject matter distinct from that of the back- 
ground discourse? Second question: Is the problematic topic fully 
legitimate, again by whatever standards may properly be applied to 
the background discourse? (What is meant by ‘legitimate’ here will 


° Word and Object (Cambridge: MIT, 1960), $27. 
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Ontologically autonomous? 
Yes No 
Legitimate? Reductionism 
Yes No 


Unttying principle avallable? Irrealigm ——> Factual Intent? 


Yes No Yes No 
Additive Discourse Eliminativism Nonfactualism 
monism pturalism 


Figure 1: A discourse status quiz. 


become clearer as we proceed.) In terms of our example then: Is the 
subject matter of moral discourse distinct from that of natural 
science?” And is it a fully legitimate discourse, or does it in some way 
fall short? 

Discourse pluralism answers “yes” to both questions, and hence 
may be distinguished from views which answer “no” to one or other 
(and also from the less interesting view, which I shall not mention 
further, which answers “no” to both). The first contrast is therefore 
with reductionism, which concedes autonomy in order to save legiti- 
macy. (We are at the first level in the tree shown in figure 1.) The 
reductionist agrees with the discourse pluralist about the legitimacy 
of moral discourse, but seeks to secure it by denying the ontological 


7 There has been much discussion as to whether there are autonomous levels of 
explanation in science. Those who argue that there are such levels commonly 
msist that, nevertheless, physics has ontological prionty. Functionalists about the 
mental will often profess to ontological physicalism, for example, though claimi 
Be ee pone Gene ot a or (rata specu 
arrangements of) ph entities or properties. For present I shall 
assume that this ontological criterion is capable of bearing the weignt, and hence 
that such a view does count as a form of reductionism in the sense of figure 1 (in 
amas ee doino Ir D arden like tee Van alin 
then I the defenders of multiple levels of description, like 1 am calling 
lapsed Humeans, would be hard pressed to distinguish their view from a discourse 
pluralism about the scientific hierarchy. 
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distinctness of its subject matter. On the contrary, the reductionist 
argues, morality is a branch of natural science—a fact that is ob- 
scured, in practice, simply by our usual ignorance of the appropriate 
identities between moral facts and natural facts. For the reductionist 
moral discourse is legitimate because it actually is natural discourse, 
albeit disguised natural discourse.® A pluralist wants both autonomy 
and legitimacy, whereas a reductionist is prepared to sacrifice the 
former in the interests of the latter. (The reductionist may not view 
it as a sacrifice, of course.) 

Our next contrast is with two views that make the alternative sacri- 
fice, conceding ground on legitimacy in order to preserve auton- 
omy. They concede ground in very different ways, and we shall need 
to treat them separately. The fact that they both do so in some sense 
is, however, a reason to deal with them under a common heading. 

Trrealism thus comes in two main forms. The first is eliminativ- 
ism, the approach famously exemplified in J. L. Mackie’s® error 
theory of moral discourse. In effect, the eliminativist holds that a 
problematic area of discourse tries to be legitimate, as well as auton- 
omous; but that it fails to connect with anything in reality, and so is 
systematically false. The problematic topic is thus legitimate in in- 
tent, but fails in the execution of that intent. Mackie notwithstand- 
ing, eliminativism has not been particularly popular among moral 
philosophers. There are more familiar cases elsewhere, however. It 
has recently become a prominent approach to intentional psychol- 
ogy, for example. And Quine has long been seen as an eliminativist 
about meaning (and hence at least in some respects about psy- 
chology). 

The second form of irrealism—nonfactualism, as I shall call it— 
is much more popular. It is the approach that preserves the auton- 
omy of disputed topics by giving ground on their factual character. 
So, whereas the eliminativist holds that moral discourse tries to be 
factual but fails to connect, the nonfactualist denies that moral 
claims are ever intended to be factual claims. They do not fail to 
connect, for their linguistic role is not to attempt to connect. What 
it is instead is a matter of some interest, of course. In the moral case, 
the traditional options are the expressive or emotivist view, namely, 
that moral judgments express certain sorts of evaluative psychologi- 
cal attitudes; and the prescriptivist view, which treats a moral asser- 
tion as something approaching a command. 


® Bear in mind that the relevant notion of reduction is an ontological one; see 
the ious fn. 
° Ethics: Inventing Right and Wrong (Harmondsworth: Penguin, 1977). 
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Nonfactualism is a very common doctrine. Apart from the moral 
case, the approach is also well-known in application to aesthetic 
judgments, mathematical statements, theoretical sentences in 
science, secondary qualities, knowledge claims, psychology more 
generally, meaning ascriptions, indicative and subjunctive condi- 
tionals, and probabilistic, causal, and modal judgments. (No doubt I 
have missed some.) Not always well-appreciated are some of the 
basic philosophical commitments of any such position. It is clear 
that such a view is committed to the existence of a significant dis- 
tinction between factual and nonfactual uses of language. Perhaps 
not so obvious is the obligation that the nonfactualist incurs as a 
result of this, namely, to account for the fact that nonfactual uses of 
language can present themselves as factual uses, at least superfi- 
cially. 

The contemporary philosopher who has done most to acknowl- 
edge and to try to meet this obligation is Simon Blackburn. !° Draw- 
ing on Hume’s idea that we project our attitudes and prejudices 
onto the world, and so see it as populated by seeming facts of our 
own construction, Blackburn has argued that the nonfactualist can 
explain our conversing as if there really were such facts. He has 
tried to show that the projectivist is entitled to a notion of truth, and 
to the other trappings of a realist linguistic practice. Thus, projecti- 
vism supports quasi realism, as Blackburn calls it. Couched in these 
terms, the nonfactualist’s concessions concerning the legitimacy of a 
disputed topic are now very muted. To be sure, moral discourse (or 
whatever) is not really factual; but it has and is entitled to all the 
trappings of factuality, all the appearances of legitimacy. So, al- 
though on this view there is only one real world, only one realm of 
genuine facts, our language quite properly works as if there were 
many.! If this is not already discourse pluralism, then what sepa- 
rates it is the availability of a substantial distinction between factual 
and nonfactual uses of language. If that goes, nonfactualism is no 
longer a distinct alternative to discourse pluralism. 

So much for the distinction between discourse pluralism and the 
various views that differ from it in denying either the autonomy or 
the legitimacy of a problematic area of discourse. We now turn to a 
contrast not marked by a disagreement on either of these issues, 


10 See in particular his Spreading the Word (New York Oxford, 1984) 

11 Why not say “as if this realm were more inclusive than it actually is,” so that 
the alternative to nonfactualism becomes additive monism? Here I anticipate a 
little—we shall see that the availability of a substantial distinction between 
and nonfactual uses of language turns out to be a requirement for the additive 
monist, as much as for the nonfactualist. 
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namely, the distinction between discourse pluralism and the doc- 
trine I call additive monism. Additive monism agrees with discourse 
pluralism in rejecting reductionism and both forms of irrealism, 
accepting that multiple domains of discourse may each be autono- 
mous and yet fully legitimate. The disagreement is only about how 
these separate domains are to be construed. The additive monist 
regards them as subdomains of a single universe of facts—not a 
single physical universe, presumably, for that would be to concede 
the game to a physicalist reductionism, but single in some sense. 
What distinguishes additive monism from discourse pluralism is the 
claim that there is something that unifies the various autonomous 
discourses; what distinguishes it from reductive physicalism, the 
claim that whatever this unifying principle is, it is not that all facts 
are ultimately physical facts. 

It is natural to wonder whether there a real distinction between 
discourse pluralism and additive monism, or whether the apparent 
issue rests on an empty metaphor. And if there is not a real distinc- 
tion, how is the single resulting position best described? These ne- 
glected questions are of enormous importance to a wide range of 
current philosophical debates. For additive monism seems the in- 
tended home of the lapsed Humeans with whom we began—of con- 
temporary non-Humean approaches to such topics as diverse as 
causation, modality, the mental, abstract objects, morality, and so 
on. On the face of it, these accounts argue that there are more kinds 
of facts in the world than the Hume world admits—facts about 
causal relations, over and above facts about constant conjunction, 
for example. The proponents of these views do not think of them- 
selves as Wittgensteinian pluralists, of course. But it is not clear that 
there is conceptual space for any other position. The additive monist 
needs to show that something unifies the various autonomous dis- 
courses. Failing this, pluralism is the default option—what the 
would-be monist must fall back to if the distinction cannot be main- 
tained. Thus, I want to argue that it is much harder not to be a 
Wittgensteinian pluralist than many people have assumed. Since a 
pluralism of this kind explicitly abandons Humean metaphysical 
minimalism, its effect here is deflationary—it undermines the 
Humean conception of metaphysical virtue. The issue turns on the 
availability of a unifying principle. 

What might the additive monist appeal to at this point? 

OI. A PRINCIPLED MONISM? 
A first thought might be that supervenience will do the trick. Per- 
haps additive monism corresponds to the view that a problematic 
topic is supervenient on a nonproblematic background (though not 
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reducible to it); and discourse pluralism to the more extreme ver- 
sion of autonomy involved in denying both reducibility and super- 
venience between discourses. A little reflection shows that this will 
not do the job, however. There are counterexamples in both direc- 
tions. Non-Humean metaphysicians commonly deny that facts about 
laws, causation, and the like are supervenient on first-order matters 
of particular fact.’? Conversely, it is far from clear that a pluralist 
need deny supervenience. A nice counterexample may be extracted 
from Blackburn’s discussion of moral supervenience.’* Blackburn 
argues that his projectivist quasi realism not only accounts for moral 
supervenience, but is especially well-adapted to doing so. It is true 
that quasi realism is not a form of discourse pluralism. But I think it 
differs mainly in assuming that some stronger form of realism is 
appropriate elsewhere, for (at least some) nonmoral topics. So we 
would have a form of discourse pluralism if we abandoned this as- 
sumption, and thus extended quasi realism “all the way down.” At 
the same time, we would still be entitled to Blackburn’s account of 
moral supervenience, for this rests simply on the idea that moral 
discourse projects evaluative attitudes. Hence it seems that super- 
venience is independent of the issue between discourse pluralism 
and additive monism. 

With supervenience out of the way, the next suggestion may be 
that additive monism is primarily a metaphysical doctrine, whereas 
discourse pluralism is primarily linguistic. The monist maintains that 
the autonomous domains of facts are all part of a single universe or 
metaphysical totality; whereas the pluralist seems to be denying the 
unity of factual language, opting instead for the view that a variety 
of different uses of language share certain superficial characteristics 
—Wittgenstein’s common clothing of the diversity of language 
games. (What are these characteristics? More on this in a moment.) 
But although this certainly distinguishes the two doctrines in terms 
of their natural characterizations, it does not exclude the possibility 
that they come to the same thing. After all, the pluralist may well 
take the view that metaphysics itself is a manifestation of the superfi- 
cial clothing of language, and derives its apparent unity from this 
very source. Thus, if metaphysics is essentially a concern with ontol- 


12 See David Armstrong, What ıs a Law of Nature (New York: Cambridge, 
1988), for example. It is my contention that these views would be better explicitly 
recast in terms of discourse pluralism, but clearly they are not intended as such. 

13 See “Moral Realism,” in ity and Moral Reasontng, Casey, ed. (New 
York: Methuen, 1971), pp. 101-24; sading the Word; and “Su ce 
Revisited,” in Exercises in Analysts: by Students of Casimir Lewy, Yan 
Hacking, ed. (New York: Cambridge, 1985), pp. 47-67. 
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ogy, with ‘‘what there is,” and this issue is seen in Quinean terms, 
then metaphysics and grammar become inseparable. Ontology is a 
matter of quantification, and this is precisely the sort of linguistic 
feature that the pluralist might take to be an element of the superfi- 
cial clothing of language, serving similar grammatical purposes in 
discourses whose underlying functions in language are widely var- 
ied. (Analogous remarks would apply to truth.) 

Monists might object at this point that they have a more substan- 
tial ontological principle in mind, and that their dispute with dis- 
course pluralism turns largely on the question of our entitlement to 
some such notion. I think that this is on the right lines, but that it 
leaves out an important part of the story. The dispute cannot be 
entirely ontological, but must have a semantic or linguistic dimen- 
sion. For the metaphysical view requires that there be a nontrivial 
sense in which the various discourses to which it applies are all 
serving the same linguistic function. It requires that they are all 
descriptive in the same sense, that they share a common goal of 
truth, a common concern to portray the facts as they are. It is not 
enough for the monist that there be a unified world out there; it is 
also crucial that within each disputed part of language, statements 
stand in the same relation to the relevant part of the single world. 
Otherwise, monism is trivial: it is easy to find a unified world to 
which every use of language relates in some sense. The monist re- 
quires that it always be the same sense. So monism needs a common 
semantic theory, as well as a unified world. 

The monist might simply begin at this semantic level, of course, 
professing a concern with truth, the facts, or some such thing (again 
claiming to construe this semantic notion in a substantial way, in- 
compatible with the pluralist’s suggestion that these concepts belong 
to the superficial common layer of language). My point, however, is 
that even if the commitment does not begin at this semantic level, it 
must get there in the end. In effect, the monist needs a distinction 
between factual discourse—the kind that does or is intended to 
describe a part of a metaphysical totality—and nonfactual dis- 
course, the kind that is not so intended." 

This obligation may seem to fall as much on the discourse pluralist 
as on the additive monist. I think there is an important asymmetry, 
however—the onus here lies predominantly with the monist. For 


“ Strictly, a monist might consistently hold that all discourse is factual (a posi- 
tion ado e.g, by Lewis in “General Semantics,” in Philosophical Papers: 
Volume I (New York: Oxford, 1983), pp. 198-232). What matters is not the 
factual/nonfactual distinction as such, but the characterization of the factual it- 
self 


398 THE JOURNAL OF PHILOSOPHY 


diversity is obvious, being guaranteed by difference of subject mat- 
ter (once reductionism is rejected). It is the monist’s unity that calls 
for substantial argument. So, not only does additive monism turn 
out to involve a linguistic doctrine after all, its monism requiring or 
entailing a linguistic characterization; but it also turns out to be a 
significantly more committed linguistic doctrine than discourse plu- 
ralism. The monist must hold that the unity of factual discourse is 
more than skin deep (or clothing deep, in Wittgenstein’s metaphor). 
Whereas the pluralist can be content with an easy going linguistic 
“‘multifunctionalism’’—with the view that under their common syn- 
tactic skin the roles of the various discourses in our lives may be as 
different as say those of our various internal organs.’* 

Additive monists thus incur a significant debt in the coin of se- 
mantic theory: they owe us a substantial characterization of factual 
discourse. (We have already noted that the same is true of nonfac- 
tualism.) I want to exploit this point to argue for the attractions of 
discourse pluralism over additive monism, or for that matter over its 
other nonreductionist rivals. In effect, I want to suggest that the 
debt thus incurred turns out to be impossible to repay in any of the 
acceptable currencies. We could accept payment in the debt’s own 
currency, accepting the fact/nonfact distinction as an uncashed 
primitive. Ordinary scientific economy recommends instead that we 
give preference to a view that needs no such primitive. Pluralism 
and perhaps a physicalist reductionism emerge as the economical 
alternatives.’® 

This line of argument depends on a claim that many readers may 
find surprising, namely, that there is not a readily available and 
well-founded distinction between factual and nonfactual uses of lan- 
guage. I have defended this claim at length elsewhere,’’ and in the 
next section I sketch some of the main points of that argument, and 
some related advantages of discourse pluralism. I think the claim 
should really not be surprising to a contemporary philosophical au- 
dience, however, for in a sense it is a natural consequence of 
Quinean skepticism about meaning. Factuality is a semantic prop- 
erty, if anything. It is the property of “‘statementhood,” or that 
property which is shared by all assertoric utterances. Thus, it is a 
particular component of meaning, and Quine’s general skepticism 

1 These also exhibit striking superficial sumlarities, looking to the untrained 
eye like so much raw meat. Who could have guessed that the ki was really a 
filter, the brait a duinker; ete P The piualin gukersang ihat the Amions oi 
language might exhibit a similar diversity and Pa f 

Physicalism might after all ar apes E R orm of discourse plural- 
ism, namely, as the view that in fact is one autonomous and legitimate 
discourse, though there could have been many. 

17 In Facts and the Function of Truth (Cambridge Blackwell, 1988). 
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about meaning might already have led us to recognize the possibility 
that there is no such property, in any substantial sense.!8 

IV. TRUTH AND FACTUALITY 
I have said that discourse pluralism and additive monism are distin- 
guished by the fact that the former makes no appeal to—indeed, 
explicitly rejects—the assumption that there is a more than superfi- 
cial category of fact-stating or descriptive uses of language. It may 
seem perverse to count this a point in the pluralist’s favor. For is it 
not obvious in what such a category consists? Fact-stating utterances 
are just those which have genuine truth conditions, those which 
have assertoric force, those which express genuine beliefs about that 
world. Surely these notions are unproblematic? 

The pluralist’s response to this challenge is to draw attention to 
the possibility that these common-sense intuitions might rely on 
nothing more than the superficial features of language, the clothing 
that covers a more fundamental diversity. Are the notions of truth, 
fact, assertion, belief, and so on foundational categories, inevitably 
central to any theoretical account of our use of language? Or are 
they mere products of language, categories thrown up by language 
itself, and not therefore presupposed by a proper explanatory 
theory of language? As a first step, the pluralist asks us simply to 
acknowledge that the latter answer is at least a conceptual possibility 
—that common sense is powerless to exclude it. Thus, an appeal to 
common sense is ruled out by the consideration that common usage 
is the very object of theoretical inquiry here. All sides are agreed 
that ordinary usage exhibits a superficial unity between the dis- 
courses in question. The issue is whether this superficial unity is 
more than skin deep. 

Perhaps surprisingly, it is the pluralist who has the better of the 
naturalistic high ground in this dispute. The issue concerns the 
structure of a complete naturalistic theory of language use. 
Roughly, it is the question whether such a naturalistic theory reveals 
a structural feature of language that may be regarded as constituting 
a fact-stating/non-fact-stating distinction. This does not mean that 
the pluralist must concede that all linguistic notions are ultimately 
naturalistic (which would be to embrace a form of reductionism that 


'* The odd thing is that Quine himself never seems to notice that his skepticism 
has this consequence. As we noted at the end of section II, he takes his minimalist 
intuitions about truth to support the view that there is a single category of de- 
scriptive discourse, contra those who would take an instrumentalist view of talk of 
abstract objects. But he fails to notice that this is the sort of “victory” which we 
achieve by sweeping the from the board. The default position is a kind of 
pluralism—not the no ist’s kind, but the Wittgensteinian doctrine that 
superficial unity of our talk of truth masks wide differences in the functional roles 
of discourses or “forms of life.” 
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the pluralist is in general keen to avoid). The pluralist simply wants 
us to recognize the possibility that what can be said about language 
in naturalistic terms might reveal no trace of a substantial factual/ 
nonfactual distinction. As a naturalistic question, this not the sort of 
thing that can be decided a priori. So discourse pluralism is ulti- 
mately an empirical doctrine, albeit a highly theoretical one, and 
one that concerns the linguistic part of the natural world. 

The onus lies with the additive monist to show not only that there 
is a substantial property of factuality underlying ordinary usage, but 
also that it has a particular distribution. The monist wants to show 
that the disputed discourses are unified by a common concern, such 
as to “depict the world.” The discourse pluralist is not denying that 
scientific utterances and moral utterances are alike in being mean- 
ingful speech acts, but simply that there is any more substantial 
sense in which they are both “‘statements of fact,” or “descriptions _ 
of how things are.” So a plausible strategy for the pluralist is to try 
to argue that none of the usual attempts to characterize such a 
property is capable of settling the issue of distribution; or in particu- 
lar of excluding the trivializing possibility that all discourse is factual 
discourse—i.e., that ‘‘statementhood” is a property guaranteed to 
be possessed by any utterance whatsoever. If a proposed analysis of 
“statementhood” cannot exclude even this possibility, it is unlikely 
to convince us that the monist has a defensible claim concerning the 
factual status of the discourses of interest. 

This skeptical attempt to trivialize any proposed analysis of the 
notion of ‘‘statementhood” is the strategy I adopted in Facts and 
the Function of Truth. To illustrate, the most common suggestion is 
that factual discourse is distinguished by its relation to the notion of 
truth. Genuinely factual utterances are said to be those which are 
genuinely truth-bearing, or genuinely “aimed at truth,” or some 
such. As several writers have pointed out, however, the notion of 
truth involved had better be capable of doing the work. One notion 
which obviously will not do the work is a redundancy or disquota- 
tional notion, which depends on nothing more than the standard 
equivalence principle: the fact that for any sentence P, ‘P’ and ‘P is 
true’ seem in some strong sense to say the same thing.’® So, if truth 
is to be invoked at this point, we need what Bernard Williams” 
called a substantial theory of truth. I argue in Facts and the Function 
of Truth that there is no such theory to be had. The contenders 
inevitably fail on one of two counts: either they are like the disquo- 


19 As we have already noted, Quine’s minimalist views about truth fall into this 
category. 
D “Consistency and Realism,” in Problems of the Self (New York: Cambridge, 
1973), pp 187-206, at p. 202 
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tational theory, in applying equally well to any sort of discourse; or 
they already assume a fact-stating/non-fact-stating distinction, in 
some other form. 

One such ‘alternative relies on the psychological distinction be- 
tween beliefs and other kinds of propositional attitudes. This pro- 
posal originates in the kind of emotivism which grounds itself on 
Humean moral psychology, drawing a contrast between descriptive 
or belief-expressing language, on the one hand, and expressive or 
evaluative attitude-expressing language, on the other. The emotivist 
treatment of moral discourse has provided a model for parallel 
treatments of other topics. In the process it has given rise, at least 
implicitly, to a general theory of the nature of the distinction be- 
tween factual and nonfactual language: to the view that factual lan- 
guage expresses belief, whereas nonfactual language expresses atti- 
tudes of other kinds. The trouble with this theory is that there is very 
little prospect of drawing the required psychological distinction, in 
advance of some other solution to the problem it is supposed to 
address.”! We need a substantial notion of belief, but we will not get 
that until we have a substantial way of drawing the same distinction 
somewhere else. t 

The skeptical attempt to drive a conventional doctrine around a 
logical circle has a famous paradigm in modern philosophy, in 
Quine’s own attack on the analytic/synthetic distinction. And as we 
have noted, the conclusion itself seems one that Quine might well 
have endorsed. Quine’s irrealism about meaning seems very much at 
home with the view that there is no substantial unity to the factual, 
descriptive, or truth-bearing part of language as a whole; no single 
such semantic category, in any substantial sense. Indeed, this view 
might be seen as simply doing for semantics what Quine himself 
does for ontology: insisting that philosophy has no privileged 
second-order vantage point, but must rather make do with the deliv- 
erances of the best first-order theories, taken at face value. 

At any rate, the effect of these skeptical arguments is to underline 
the economical advantages of discourse pluralism, by showing that a 


Cee E ce E roi aa class of beliefs. 
See, in icular, Smith, “The Humean ‘Theory of Motivation,” Mind, 
xcvi (1987): 36-61, ip Pettit, ““Humeans, Ant}Humeans and Motivation,” 


Mind, xcvu (1988): 530-3; Lewis, “Desire as Belief,” Mind, xcvi (1988): 323 
32; John Collins, “Belief, Desire and Revision,” Mind, xcvm (1988) 3338—42; 
Michael Smith, “On Humeans, Anti-Humeans and Motivation: A to Pettit,” 
Mind, xcvo (1988). 589-95; my “Defending Denre-as-Belief,” Mind, XCVI 
(1989): 119-27, and Philip Pettit and my ‘Bare Functional Desire,” Analysis, 
XLIx (1989): 162-9. 


402 THE JOURNAL OF PHILOSOPHY 


useful characterization of factual discourse is not as readily to hand 
as we have tended to assume. But it might be objected that pluralism 
incurs costs of its own. For one thing, it owes us an account of what 
it regards as the superficial unity of language—for example, of the 
fact that all discourses apparently avail themselves of the same indic- 
ative syntax, and of the notion of truth. For another thing, we are 
entitled to be told in what the underlying diversity is supposed to 
consist—in what dimension the different discourses enjoy their ver- 
tical separations. Related to the latter point is the important ques- 
tion as to how we set the limits to pluralistic tolerance. Clearly there 
are some conceptual conflicts that call for a less tolerant response, 
for simply discarding one of the alternatives. The obvious thought is 
that this is appropriate when the theories concerned are attempting 
to occupy the same linguistic role, or level. But what does this mean, 
in general? 

These are real obligations for a discourse pluralist. I think they 
should not be unwelcome, however. On the contrary, they provide a 
further opportunity for pluralism to display its comparative credit- 
worthiness, by showing that it has the means to meet such concep- 
tual debts. In Facts and the Function of Truth, I tried to provide a 
suitable account of truth for these purposes—an account that has 
the potential to explain why diverse uses of language should be alike 
in being treated as “‘truth-bearing.”” The basic idea is that despite 
the diversity of uses of language, different uses tend to be alike in 
this respect: within each use or discourse, there is some potential 
utility in noting and resolving disagreements between the members 
of a speech community. A notion of truth encourages a community 
to realize this potential. It does so by generating a social value that is 
negative when speakers disagree and positive when they agree. Dis- 
agreement thus becomes socially unstable, and argument, with its 
long-run benefits, is thereby encouraged. 

This approach does not require that argument is always benefi- 
cial, of course. Like any evolutionary theory, it keeps its eyes on the 
long run. Nor, importantly for a pluralist, does it require that the 
kind of benefit always be the same. The pluralist can thus allow, on 
the one hand, that there is no uniform way in which each of the 
many language games affects our well-being; and yet, on the other, 
that in all or most such games there is some benefit in argument, and 
hence in the availability of a notion of truth. In fact, I think the 
pluralist can do considerably more than this. For it turns out that 
our use of the notion of truth is not strictly uniform across the range 
of different domains of discourse. There are significant differences, 
underlying the predominant pattern. I think that pluralism can do 
much to account for these variations, by appealing to the different 
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functions or linguistic roles of the different domains concerned. . 
And only pluralism has the flexibility to do this: other approaches 
are tied too rigidly to the existence of a fact-stating/non-fact-stating 
dichotomy in language. 

What are these discourse to discourse variations? They turn on 
the differential tendency of arguments to be resolved in such a way 
that neither participant is judged to have been initially mistaken— 
for example, the kind of thing that may happen when a disagree- 
ment about a probability turns out to have stemmed from the fact 
that the speakers concerned had access to different evidence. This 
kind of outcome to an argument manifests itself as a sudden reluc- 
tance to apply the notions of truth and falsity to the judgments 
concerned. Such no-fault disagreements occur for different reasons 
in different parts of language. To a surprising extent, however, it 
seems to be possible to explain where they do occur in terms of an 
understanding of the distinctive functional role of the type of dis- 
course concerned, coupled with the general principle that the ori- 
gins of truth and falsity lie in their role in encouraging useful argu- 
ment. (In the process, incidentally, it is possible to explain many of 
the intuitions that have motivated nonfactualism—but to do so 
without the nonfactualist’s problematic division of language into 
fact-stating and non-fact-stating categories.) . 

This approach has affinities with recent work of Crispin 
Wright’s® on what might be called the “fine structure” of truth. 
Wright has distinguished a number of different components that 
may, but need not all, be characteristic of the use of truth in associa- 
tion with a particular area of discourse. In effect, he suggests that 
we may classify a subject matter according to which of the set of 
these characteristics we take its notion of truth to involve. As I 
understand it, his concern is mainly descriptive and taxonomic. I 
think there is some prospect, however, that the structure thus dis- 
cerned will turn out to be explicable as sketched above, in terms of 
some general account of the function of truth in language. Whether 
the best general account will be in the terms I outlined earlier re- 
mains to be seen. But if it is to be in keeping with Wright’s program, 
I think it will have to be like my account in being explanatory rather 
than analytic: its focus will be not on the question ‘What ts truth?” 
but on the question ““Why do ordinary speakers have such a notion 
as truth?” 

Interestingly, Wright too takes this stance on truth to support a 
noncomunittal metaphysics, at least as the natural fallback position. 

™ See ‘‘Realism—the Contemporary Debate: Whither Now?” forthcoming in 
Reality, Representation and Projection, J. Haldane and C. Wright, eds. (New 
York: Orford). 
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He characterizes this position as an antirealist one, in Michael 
Dummett’s sense, on the grounds that the minimal notion of truth it 
requires can be thought of as derived from assertibility. I disagree at 
this point, arguing that in practice the minimal notion of truth is 
already realist in this sense, for reasons to do with the origins of 
negation.” That issue aside, I take the following remark of Wright’s 
to be in much the same vein as my emphasis on the economical 
advantages of discourse pluralism: ‘‘Anti-realism thus becomes the 
natural, initial position in any debate. It is the position from which 
we have to be shown that we ought to move. All the onus, every- 
where, is on the realist” (op. cit., p. 11, typescript). Substitute ‘plur- 
alist’ for ‘antirealist’ and ‘nonpluralist’ for ‘realist’, and these are my 
sentiments exactly. 

The project also has affinities with Blackburn’s quasi realism. The 
quasi realist about moral discourse (for example) wants to argue 
that, although moral judgments are not really factual, they are enti- 
tled to the trappings of factuality, including a respectable notion of 
truth. So, although there is only one domain of genuine facts, it is . 
quite proper on this view for ordinary usage (and truth in particular) 
to work as if there were many. As we noted earlier, what separates 
this from discourse pluralism is the quasi realist’s assumption that 
his project is a limited one, bounded in its application by the avail- 
ability of a substantial distinction between factual and nonfactual 
uses of language. If that distinction lapses, quasi realism is no longer 
a distinct alternative to discourse pluralism. Discourse pluralism is 
again the default position. But so long as a more limited quasi real- 
ism remains a live project, the quasi realist’s interest in explaining 
why certain discourses should usefully employ an “artificial” notion 
of truth will apparently coincide with the project described above. 

In sum, there are considerable grounds for doubt as to whether a 
useful principle of semantic unity is available to an additive monist, 
other than as an ad hoc and costly theoretical primitive. And there 
are considerable grounds for optimism concerning the project to 
explain what superficial unity we find in language (and the limits of 
this unity) on foundations that only the discourse pluralist finds 
congenial. The case is not closed, of course, but it is the additive 


monist who is on the defensive. 
V. PLURALISM AS DEFENSIBLE REALISM 


At this point it may be helpful to emphasize that discourse pluralism 
is not an irrealist position. The pluralist accepts with all sincerity 
that there are moral states of affairs, possible worlds, numbers, or 





2 See my “Sense, Assertion, Dummett and Denial,” Mend, xcu (1983): 174- 
88; and “Why ‘Not’?” Mind, xcrx (1990): 221-38. 
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whatever. What she rejects is the additive monist’s attempt to put a 
further metaphysical gloss on such existential claims. (That gloss 
turns out to depend on a semantic distinction of questionable stand- 
ing, or so the pluralist argues.) Without the gloss, discourse 
pluralism sits quite happily with a nonmetaphysical or “minimal” 
realism.** 

Perhaps more surprisingly, this species of realism turns out to be 
particularly well-protected against one of realism’s traditional oppo- 
nents. Discourse pluralism has a defense against an instrumentalist 
challenge, of a kind that continues to be pressed against realists in a 
number of branches of contemporary philosophy. This defense is 
simply not available to the monist, for, as we shall see, the antidote 
the pluralist is able to employ against instrumentalism works equally 
well against additive monism. 

Instrumentalists prey on the Humean tendencies of the modern 
philosophical public, offering a simple path to ontological ascetic- 
ism. To avoid the realist’s ontological excesses, they tell us, we need 
simply treat our talk of such ontology as a convenient instrument. 
The talk works just as well when construed this way, and yet in 
ontological terms it is absolutely free. Thus, vast ontological savings 
compared to realism, and at no extra cost elsewhere. 

Of course, realists often respond that the offer is too good to be 
true. There is a cost elsewhere, albeit one that may not be obvious at 
first sight. What we lose under instrumentalism, the realist claims, is 
the ability to explain why the instrument works. The need for such 
an explanation thus becomes a debt that the instrumentalist cannot 
pay off. 

This response is a good deal less effective in some cases than in 
others, however. Whatever its merits as an objection to instrumenta- 
lism about theoretical physics, it is likely to be of little use in defense 
of realism about possible worlds, for example. Here the realist will 
typically acknowledge that a proper explanation of why talk of non- 
actual worlds is useful can only appeal to the actual world—i.e., in 
effect, acknowledge that such talk would be just as useful if there 
were no other worlds. Similar considerations might apply in the case 
of intentional psychology. Realists might well grant that an adequate 
explanation of why talk of beliefs and desires is useful could in 
principle be cast in terms of scientific theories making no reference 
to intentional states. 

In cases such as this the realist needs a different strategy, and 


Se en eee E ee a soe Tobe Capt Da 
course pluralism seems compauble with what Campbe the “simple view” 
about entities or properties of a given kind. See, e.g., his ‘ ‘A Simple View of 


Colour,” in Reality, Representation and Projection. 
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might be tempted to appeal to Quine. For has Quine not taught us 


that there is no more to the question whether there are possible 
worlds (say) than the matter as to whether such things are quantified 
over in the most serviceable philosophical theory of modality and 
related topics? And given that the instrumentalist does not deny us 
our talk of possible worlds (quantification included), do we not 
therefore have Quine’s authority for rejecting the instrumentalist’s 
claim that there are no such things? Does not the talk itself provide 
all the warrant we ever have for an existential claim? 

It is important, however, to appreciate that there are two ways to 
interpret this argument, resting on two quite different ways of inter- 
preting Quine’s views on ontology. On one reading, the Quinean 
doctrine is effectively a principle of ontological quietism—the prin- 
ciple that there is no separate second-order science of ontology, but 
simply the mundane business of existential quantification carried 
out by first-order specialists in the course of their working lives. I 
shall come back to this: I think it does provide an objection to 
instrumentalism, but not the objection the metaphysical realist was 
hoping for. There seems to be a tendency to read the Quinean 
doctrine in quite a different way, however. Under this second read- 
ing, the activities of first-order specialists provide raw data to which 
the Quinean principle is then applied, yielding second-order onto- 
logical conclusions. In other words, the fact that first-order special- 
ists quantify over the entities of a certain kind is held to constitute 
evidence that such entities actually exist. Far from dismissing the 
science of ontology, Quine’s doctrine thus becomes the main instru- 
ment in the working ontologist’s tool kit—a kind of “magic eye” for 
detecting otherwise invisible existents. 

Had Quine provided us with such a magic eye it would indeed 
provide a powerful weapon against instrumentalists (or at least those 
instrumentalists not prepared to take issue with Quine). “See for 
yourself,” the realist would say, inviting the instrumentalist to peer 
at reality through Quine’s lens. But I think that to make this explicit 
is to make it obvious that Quine intends no such thing. The right 
reading of Quine is the quietist one.” Now this reading too counts 

ris asta ae abn af Suis sens fone Ta parallel d manaoeaarn 
concerning the role of best explanaton of science. It is part of the practice o 
science to accept the best current explanations of observed mena. This is 
hardly more than a truism: to say that an explanation is the we have is ipso 
facto to mdicate that we give it more credence than anything else on offer. There 
is a tendency to make a second-order principle of inference out of this: to say that 
because they provide best ions, the theories concerned are to be 
true, and therefore worthy of a further degree of credence. Another similar mis- 
take is that of thinking that the principle that the best choice is the one thar 
Maximizes one’s expected utility assigns a further value to the choice in question, 
over and above its expected utility. i 
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against the instrumentalist. The instrumentalist requires that not all 
good theories involve ontological commitment (via their existential 
quantification); for it depends on whether the theory in question is 
genuinely descriptive or merely an instrument. As a quietist princi- 
ple Quine’s doctrine contends that there is no such further issue. 

But does this favor realism? Yes, but only a realism of the dis- 
course pluralist’s minimal kind. For what the quietist principle de- 
nies us is something that is needed by additive monists, as well as by 
instrumentalists. Instrumentalism is a form of nonfactualism. Like 
additive monism, though for a different reason, it thus depends on a 
substantial distinction between factual and nonfactual uses of lan- 
guage. The monist needs an account of what keeps the domains of 
discourse together; the nonfactualist an account of what sets them 
apart. The instrumentalist thus owes us an account of what is lacked 
by those domains of discourse which are judged “merely” instru- 
mental, and hence of what is not lacked by those which pass the test. 
Without such an account, for one thing, there seems nothing to stop 
the above kind of argument for instrumentalism from going all the 
way down. All discourse would thus be construed as fictional dis- 
course, and the contrast that gives point to instrumentalism would 
be lost. Modal and moral discourse would be on the same footing as 
discourse of any other kind, just as the pluralist and the additive 
monist both contend. 

So the instrumentalist needs a substantial factual/nonfactual dis- 
tinction. Often this is supposed to be cashed in ontological terms. 
The instrumentalist is presented as denying the existence of a class 
of entities of a certain kind—possible worlds, or mathematical ob- 
jects, for example. We have just encountered a reason, however, for 
doubting that this can serve as the fundamental distinction. Con- 
strued in a Quinian sense, ontological commitment is a more or less 
trivial consequence of serious theoretical commitment; even if there 
is always room for argument as to which ontology strikes the best 
ideological bargain. Hence instrumentalism cannot be primarily an 
ontological doctrine. The instrumentalist needs a prior distinction 
between full-blooded and merely fictional theoretical commitment. 
An ontological distinction would flow from such a distinction, but 
cannot provide it; not, at least, unless we are prepared to discard the 
Quinean principle that our only guide to ontology is the quantifica- 
tional structure of accepted theory. 

All of this applies equally to the additive monist. Quine’s ontologi- 
cal quietism prevents the monist from casting the unifying principle 
he requires in ontological terms.” There is nothing to prevent a 

* | argued in section III that the monist’s unifying principle cannot be entirely 
ontological in any case, but must involve a semantic component. The present 
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discourse pluralist from embracing the Quinean doctrine, however, 
the pluralist, too, will be interested in the most advantageous theo- 
retical formulation of any given level of discourse, and may well see 
the choice in Quinean terms. The pluralist thus has an objection to 
instrumentalism which would be suicidal in the hands of an additive 
monist. 

Discourse pluralism thus provides a particularly defensible form 
of realism—a position uniquely placed to exploit some of the hid- 
den costs of irrealism. A corollary is that those who find themselves 
in the grip of realist intuitions about matters that are otherwise 
vulnerable to instrumentalist attack would do well to take pluralism 
seriously, for it may well be the only realism they can have. 

VI. CONCLUSION 

We began with two ways to respond to the moral inconsistency of a 
prosperous Franciscan. One response was simply to point out that 
he could well make do with less. I compared this to challenging the 
lapsed Humeans’ case for particular metaphysical luxuries. This is 
what nonfactualists do, in effect, in arguing that in order to make 
sense of say modal discourse, we do not need to assume a realm of 
modal facts; instead, we may read the discourse nondescriptively, so 
that its interpretation does not require such facts. My main goal in 
this paper has been to offer an alternative deflationary challenge to 
non-Humean metaphysics. This compares to granting that the Fran- 
ciscan has the same range of human needs as anyone else, and the 
same right to reasonable satisfaction of those needs; but insisting 
that in granting this, we overturn the idea that there is any special 
virtue in self-imposed poverty. 

Discourse pluralism thus allows the lapsed Humeans their talk of 
possible worlds, modal facts or whatever, subject only to the modi- 
fied Quinean requirement that such an ontology provides the most 
economical basis for the particular discourse in question (i.e., 
roughly, the basis that best enables that discourse to serve its in- 
tended function in human life). What it withholds is the concession 
that such talk maps the bare objective structure of a single indepen- 
dent reality. On the contrary, we pluralists maintain, the idea that 
there is such a mapping rests on a theoretical error about language, 
namely, the view that there is a single. substantial category of de- 
scriptive or fact-stating discourse.”’ It follows that the Humean con- 
ception of metaphysical virtue was always misguided. In granting the 
lapsed Humeans their metaphysical comforts, we thus deny them 


point is that Quinean quietsm undercuts the ontological strategy at first base, as 
it were. 

T We pluralists think that many of those who try to deflate the lapsed Humean 
in the first way also fall prey to this misconception 
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their Humean metaphysical virtue; for we say that there has never 
been any such thing. 

Pluralism thus takes the heat out of contemporary metaphysical 
debates about a very wide range of topics: modality, mathematics, 
moral discourse, conditionals, and many others. As a final example, 
a well-motivated pluralism might allow one to endorse something 
like Daniel Dennett’s™ account of the role and origins of intentional 
psychology, without having to concern oneself as to whether the 
view amounts to instrumentalism about the mental. 

Some readers will see this supposed advantage as like that of theft 
over honest toil—or perhaps more accurately, of idleness over hon- 
est toil. I see it as rather the advantage of idleness over unnecessary 
and unproductive toil—the advantage of stopping work on a mis- 
conceived and unproductive philosophical seam. True, we cannot be 
certain that the seam will remain unproductive, but there is enough 
doubt about the matter for the wise course to be to consider some 
of the general issues on which its eventual productivity might de- 
pend. I have emphasized that these are naturalistic issues, in particu- 
lar issues about language as a natural phenomenon. (Does it admit a 
significant descriptive/nondescriptive distinction, for example?) So 
they are hardly issues that the pluralist’s likely opponents may justly 
ignore. 

At the very least, we could then go back to work with a better idea 
of what we were looking for in the first place. Instead, however, we 
might want to explore other philosophical seams, other projects that 
would now seem more promising: perhaps the elaboration of the 
general descriptive/explanatory approach to truth; perhaps the 
broadly Kantian examination and classification of different domains 
of discourse, with reference to their role in human life;? perhaps 
the exploration of nice issues concerning the connections and dis- 
connections between the various discourses. A different philosophy, 
certainly, but not an idle one. 

HUW PRICE 
University of Sydney 

3 See, e.g., "Real Patterns,” this JOURNAL, LXXXVII, 1 (January 1991): 27-51. 

™ Recent work attempting to generalize the notion of a secondary quality might 
be seen as falling under this heading. See for example Wright, “Realism-the Con- 
temporary Debate: Whither Now?”; and Mark Johnston, “ \jectivity Refigured,” 


ın , Representation and h ; Jobnston’s “ Theories of 
Value,” of the Aristote Society, sg oe H (1989): 139-74; 
and Pettit, “Realism and Response-Dependence,’ c (1991): 587-626. For 


a critical view of this program, offering an alternative more in keeping with dis- 
course pluralism, see my ‘Two Paths to Pragmatism,” in Response-Dependent Con- 
TA Peter Menzies, ed. (Canberra: ANU Research School of Social Sciences, 
1991), pp. 46-82 


410 THE JOURNAL OF PHILOSOPHY 
RIGIDITY, ONTOLOGY, AND SEMANTIC STRUCTURE* 


r | The notion of rigid designation stands squarely at the inter- 
face of metaphysics and philosophy of language.’ Itself a 
semantic notion, it is metaphysical in its applications, and in 

that whether an expression is rigid depends on whether its reference 
is in all worlds the same. Rigidity has played an important role in the 
development of a broad philosophical view that has largely sup- 
planted the dominant empiricism of the earlier part of this century. 
On this new view, important questions about what is necessary, what 
features of things are essential, and what the conditions of identity 
and individuation are for many things are factual, indeed empirical 
questions. They require not conceptual or semantic analysis, but are 
rather to be settled by looking at the world, investigating the objects 
in question, rather than the expressions used to refer to them. Our 
findings tell us about the “deep” structure of the world, not about 
our conceptual scheme. Rigidity plays a central role in all this, be- 
cause it is the fact that many key terms are rigid which allows us to 
fix upon an object so as to investigate it with respect to both the 
actual and other possible worlds. This new view, for which Kripke 
and Hilary Putnam’ are largely responsible, is partly characterized 
by the following claims (these may not all be distinct views): 


(1) By using expressions rigidly, we can ask counterfactual questions 
about objects without any a priori specifications concerning the 

. properties of the object of reference. 

(2) We can stipulate possible worlds without worrying about or set- 
tling questions concerning transworld identity conditions. 

(3) Our modal inquiries are directed at the objects? themselves, and 
in no way concern our concepts or the meanings of words. 

(4) Thus, we can make a posteriori de re modal discoveries—we can 
have empirical discovery of essence. These findings represent lan- 
guage and mind-independent modal features of objects. 

(5) We can form identity statements between rigid designators 


* Many people have helped me with the issues in this paper. Thanks to Thomas 
A. Blackson, David O. Brink, Berent Enc, Ted Everett, Martha Gibson, Paula 
Gottlieb, Elliott Sober, Dennis Stampe, and Leora Weitzman 

1 Saul Kri mtroduces the notion in Naming and Necessity (Cambridge: 
Harvard, 1 ), pp 48-9, and provides a fine overview m his , pp. 3-15. 

* Putnam's views are presented in many of the essays in Mind, Language and 
Reality (New York: Cambridge, 1975); ‘The Meaning of ‘Meaning’ ” is probably 
the most important. 

3? To ease exposition, I shall use ‘object’ to range over any object of reference 
_ or inquiry, it includes, e.g., kinds as well as individuals. 
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G) if the expressions happen to tag the same object, the state- 
ment is true, and necessarily so; 

(ii) if we can only find out the truth of this statement empirically, 
we again have a posteriori necessary truths, 

(iii) metaphysically and semantically, there is nothing more to the 
truth of an empirical identity statement than “same thing 
tagged.” 

(1)}H{3) contrast with the more traditional view according to which 
reference is determined through meaning. On that view, one cannot 
sensibly talk about an object in another possible world except as it 
satisfies, in that world, the conditions given by the meaning of some 
word. But then it is specified a priori that the referent of the word, 
whatever it is, has those properties, and to specify a possible situa- 
tion is to specify one in which an object with those properties is in 
some situation, thus settling the issue of transworld identity. And 
since our possible world descriptions are constrained by the mean- 
ings of our words, we have no independent grasp of the object to 
investigate 1f—all we are really investigating are conceptual connec- 
tions within our framework. (1)(3) represent the more ‘‘world-” or 
“object-directed” understanding of these sorts of philosophical in- 
quiry. (4) and (5) are two more overtly metaphysical features of the 
new view, exemplified by the purportedly necessary a posteriori 
‘Water is H,O’, ‘Margaret Truman has Bess Truman for a biological 
parent’, and ‘Hesperus is Phosphorus’. The “‘objéct-directed” un- 
derstanding of (1){3) feeds the metaphysical, rather than ‘‘concep- 
tual,” understanding of these phenomena, as represented in (4) and 
(5). I shall not discuss or criticize these positions individually; they 
are introduced here simply to point the reader toward a position 
that I hope is familiar, and understood to have a great deal of philo- 
sophical currency. 

In this paper, I shall investigate how rigidity plays such an impor- 
tant role in contemporary metaphysics. I shall argue that the role 
depends on a certain understanding of rigidity, which I call the 
metaphysical understanding (sections I, IJ): this is the view that the 
semantics of rigid expressions is determined by mind-independent 
facts of transworld identity. I shall also offer a more semantical 
understanding of rigidity, with which this may be contrasted (section 
VI). The metaphysical understanding, I argue, requires a privileged 
ontology according to which some, but not all, ways of carving up 
the world get at “‘real” objects (sections III, IV). The semantic inter- 


* Of course, this is not the content of an identity statement on this view. It is 
just all that there is to such a sentence’s being true 


412 THE JOURNAL OF PHILOSOPHY 


pretation, by contrast, carries no such metaphysical burden. While I 
shall not argue against privileged ontologies, I think it important to 
see this commitment. Many who accept the new view may not realize 
that they are so committed, and may not be inclined to accept such 
an ontology. Further, anyone who wishes to buy into the currently 
prevalent picture which (1)}-(5) partially characterize must defend 
this ontology. It is not at all supported, as one might think, by our 
intuitive judgments about which terms are rigid (section V). Indeed, 
I hope to show that it is by tacitly assuming such an ontology in 
familiar presentations of rigidity-supporting intuitions that we are 
so easily led to this larger metaphysical picture: the ontology, not 
our judgments of rigidity, does the work (section V). I leave it to 
defenders of the new view to try to provide the necessary argu- 
ments; I also hope my semantic account of rigidity will be of interest 


whatever one’s ontological views. 
I. RIGIDITY DE JURE AND DE FACTO; 
REFERENCE THROUGH SENSE AND IDENTITY 


An expression is rigid if-it has the same reference with respect to all 
possible worlds (in which it refers). Kripke distinguishes two ways an 
expression may be rigid, de jure and de facto (op. cit., p. 21 fn.). 
These correspond to two semantic interpretations for expressions, 
most easily illustrated with definite descriptions. 

The statement ‘The President of the U.S. is necessarily male’ has 
two familiar readings: on one reading, it ascribes an essential prop- 
erty to George Bush; on the other, it states the impossibility of a 
nonmale attaining to a certain political office. Semantically, “The 
President of the U.S.’ functions differently in the two cases.” In the 
first, the description picks out some individual in virtue of its de- 
scriptive content, and is understood to refer, in other possible 
worlds, to that individual (Bush), and its counterfactual properties 
determine the truth value of the statement. The descriptive content 
of the expression falls away; it is relevant only in determining an 
actual referent. So used, we may say that such descriptions gain 
their referents in other worlds through identity, i.e., through the 
identity of objects in other worlds with the actual referent. 

On the other reading, the description also picks out an individual 
by its descriptive content, but it picks out individuals in other worlds 
by the very same content. In each world, the individual that satisfies 
this descriptive condition (having a certain office) is its referent. 


5 While one can represent this by a scope distinction, there remains the ques- 
tion of what in the utterance of a modal sentence with descriptions makes it 
properly interpreted as having wide or narrow scope. 
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Descriptions so used may be said to gain their referents in other 
worlds through sense, i.e., through the meaning of the description. 

A description that refers counterfactually through identity will 
obviously be rigid, since the very condition for the expression’s ap- 
plying to something in another world is that that thing be identical 
to the actual referent. The rigidity of such expressions is rigidity de 
jure; the function of such an expression with respect to other 
worlds is precisely to pick out that very object. Descriptions that 
refer counterfactually through sense, on the other hand, may, and 
typically will, turn out to be nonrigid, for the objects that satisfy the 
conditions in various worlds may not all be the same, as with ‘The 
President of the U.S.’ In some worlds, it is George Bush, in others, 
Michael Dukakis or Elizabeth Dole. Now, it can turn out that the 
referents are all the same, as with ‘The sum of two plus two’. Such 
expressions will be rigid, but, so to speak, by accident.® Their rigidity 
is rigidity de facto. The semantic function of the term, as regards 
other worlds, is given by a descriptive condition—t is not stipulated 
that the referent in other worlds is to be the actual referent. It just 
turns out that way. 

These distinctions are not confined to descriptions. For any refer- 
ring expression useable in counterfactual discourse, we can ask how 
it functions semantically (through identity or sense), and can classify 
its rigidity, if any there be. Of course, if an expression has no de- 
scriptive content, no semantic structure, all that can determine its 
counterfactual reference will be the actual referent itself; such 
“contentless” expressions will consequently refer through identity 
and be rigid de jure. According to direct or causal theories of refer- 
ence, names are like this, and they provide the paradigm of rigidity 
de jure. 

Host philosophers have not taken much interest in rigid de facto 
expressions. The reason, I think, is that they cannot play the meta- 
physical role exemplified in claims (1)5). Since a rigid de facto 
expression has its counterfactual reference determined through 
sense, the answers to important modal questions will be analytic and 
a priori. When ‘The President of the U.S.’ is used this way, it is given 
a priori that the referent in any world will be the president there. 
This also constrains counterfactual stipulations we can make using 


* One can intend such a description to refer counterfactually through identity, 
making its rigidity not “accidental,” but de jure. With such ‘identifying’ descrip- 
tions, we may to notice that they can be ngid in both ways It is easier to see 
for ‘the element with atomic number 79’ (assuming that through sense, this 
rigidly picks out gold). 
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such expressions—we cannot stipulate worlds in which the referent 
fails to have these sense-specified properties. And this constraint is 
given a priori (unlike our inability to stipulate worlds that violate 
necessary a posteriori truths). So here, our modal investigations are 
not so ‘‘object-directed” as when reference is through identity. 
When reference works through sense, even when rigid, our means 
of getting an object of modal inquiry already build in, in making it 
the object of inquiry, what counterfactual properties it must have.’ 
The lack of interest in rigidity de facto is the (implicit) recognition 
that, for the metaphysics, rigid reference must occur through 
identity. 

And for metaphysics, then, the relevant distinction is not that 
between rigid and nonrigid expressions per se, but that between 
expressions which refer counterfactually through identity, and those 
which refer through sense.” Names have pride of place because (on 
the current view) they always refer through identity, while descrip- 
tions only sometimes do. Note, though, that names need not be 
wholly without content to be rigid de jure. All that matters is that 
this content plays no role in determining their reference in other 
worlds. If names need to, or sometimes do get actual referents by 
reference-fixing descriptions,’ they will function just like descrip- 
tions used rigidly de jure: both will have their actual referents de- 
termined through sense, but their counterfactual referents deter- 
mined through identity.’° 


7A “sense” based view can allow some modal questions to be left open—for 
counterfactual conditions may be specified generally or abstractly (eg, ‘same 
origin’ or ‘same deep structure’), leaving empirical investigation to find out what 
in particular (actually) ‘‘fills m” these conditions. This means that a (modified) 
traditional view can allow necessary a posteriori truths, and “stipulation” of other 
worlds need not (implicitly) answer all modal questions. But what ıs left open 
would have modal import only because of the semantics, undermining the sup- 

metaphysical import of (1}{5) mostly given by (3), the second sentence of 
4), and (5) (ili). For examples of such “general” specification, and more on a 
“conventionalist” account of a posteriori necesnty, see my Necessity, Essence 
and Individuation (Ithaca: Cornell, 1989), esp. chs. 2, 3. 

8 Or, alternatively, between expressions that are rigid de jure and all others. As 
the mechanism is what matters, I prefer my first formulation. The umportance of 
this distinction is also noted by A. D. Smith, “Rigidity and Scope,” Mind, XCI, 
369 (April 1984) 177-93. 

°? See Knpke, Naming and Necessity, pp. 55-6 for his troduction of the 
reference-fixing/meaning-giving distinction. 

10 Thus, a causal account of reference is not, as such, vital to the “new” meta’ 
physics. Further, while Kripke has answers to description theorists who claim that 
names are descriptions which happen to always take wide scope in modal contexts, 
their ponton would anyway not touch the metaphysical side of his work, as the 
senses of such descriptions fall out as irrelevant to determining counterfactual 
reference. 
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Il. THE METAPHYSICAL UNDERSTANDING OF RIGIDITY > 

The familiar picture of how rigid designation allows for empirical 
inquiry into necessity and essence actually makes plain that the im- 
portance of rigidity resides in rigidity de jure and reference through 
identity. The story is that we introduce a term to refer rigidly to 
some object. This term then gives us a handle on that entity so we 
can investigate its properties, not only actually, but also, because of 
rigidity, in other possible worlds. The subject of our investigation is 
then that object, and any positive results we find give us modal 
properties of it.!! 

This picture is, I hope, familiar enough. But as we have seen, if 
the object is truly to be the subject of our inquiry, it cannot just be 
rigidity at work here. What is key is not simply being able to refer to 
some object in other possible worlds, but being able to do so in a 
way that does not build in what features it will have in these worlds. 
We need the object itself, and not some description, to determine 
the expression’s counterfactual reference. Expressions that refer 
counterfactually through sense do not meet this condition, even 
when they are rigid (de facto). 

Philosophers who do accept claims (1}{(5) then, accept, or are 
committed to, a particular understanding of how rigidity works, at 
least in the relevant cases (those in which we get empirical discovery 
of necessity). We can characterize this commitment first by a nega- 
tive claim, the descriptionlessness thesis: 


DL: (Many)!? rigid designators have no descriptive content determin- 
ing (or constraining) their application in counterfactual situ- 
ations. ' 


Our earlier remarks make clear why we do not say simply ‘have no 
descriptive content’. Descriptive content that merely determines ac- 
tual reference, as in rigid de jure descriptions, is no bar to the sort 
of object-directed modal inquiry in which we are interested. 

But how, then, és counterfactual application for such terms deter- 
mined? It is through identity: the counterfactual referent just is the 
actual referent. I shall call this positive claim the metaphysical un- 
derstanding of rigid designation: 


1! Such investigations might have turned lg HE er o 
argued (in “How Not to Derive Essentialism of Reference,” this 
JOURNAL, LXXVI, 12 (December 1979): 703-25; ey Reference and Essence 
(Princeton: University Press, 1981)), the theory of reference, or the notion of 
rigidity does not siself give the metaphysical results. But ıt renders such investiga- 
tions sensible, whereas on traditional theories, we can only investigate objects as 
they satiify the conditions given in the schie: 


The qualifier 1s just to acknowledge rigid de facto descriptions. 
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MRD: The reference of (many) [see fn. 12] rigid designators in other 
worlds is given metaphysically—it is whatever object in the 
world in question is (identical to) the actual object of reference. 


I call this the “‘metaphysical” understanding because once one has 
determined an actual referent (and perhaps has intended that the 
expression be rigid), the world takes over—the facts about identity 
across worlds determine counterfactual reference. . 

Whether MRD is the same as the claim that many rigid designa- 
tors are rigid de jure, or have their reference determined through 
identity, depends on how we choose to use these latter notions.’® 
Whatever we decide about that, MRD is worth separate presenta- 
tion, since it states explicitly what is required for rigidity to do its 
metaphysical work. MRD asserts not only that some expressions 
have their rigidity semantically built in, but further, that this hap- 
pens metaphysically; it is simply the mind/language independent 
fact that x in w, is identical to y in the actual world that makes x the 
referent of T in w,. This takes a definite stand both on there being 
such independent facts, and on our ability to fix our terms’ refer- 
ents without this reference-fixing playing the crucial role in deter- 
mining counterfactual reference (in some way other than on tradi- 
tional theories); this is left open, at least on one reading, by the 
notions of rigidity de jure and (perhaps) reference through identity. 

I hope that many readers recognize MRD as the view that they do 
have about how rigid designation works (except for rigid de facto 
descriptions); I have mostly just tried to articulate it, and to explain 
why those who accept the metaphysical view that usually goes with 
presentations of rigidity are committed to it. 

My next task is to show that MRD—and so (1)(5}—requires a 
“privileged” ontology, and that the phenomenon of rigidity does 
nothing to support such an ontology or, consequently, MRD. 


13 Since it is so typically assumed that “‘nonaccidentally” rigid terms always lack 
descriptive conditions for counterfactual reference, the issue of whether rigidity 
de jure is defined by “built-in” rigidity, or also requires nondescripuve reference, 
does not arise. As we shall see (sec VI), however, S e a sacle 
despite having descriptively determined counterfactual reference. So we 
have to decide. If rigidity de jure 1s just built in rigidity, then it does not suffice for 
rene pave reference precludes igi de ur its the same aa MRD- tnong i 

ee ee it is the same as MRD—though it 
names, say, are rigid de pure. Similar remarks apply for 
no longer data” at n 

The important question is: Do names, and expressions that function in modal 

contexts names, have their counterfactual reference determined by mind- 
t facts about the identity of objects across worlds? This is the question 

of ether MRD is true. Whether it is also the question of whether there 1s in fact 

rigidity de jure, or reference through identity, is but a terminological matter 
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Il THE STRANGE AND UNUSUAL 

Let me describe a purported entity, which I shall call “The President 
of the U.S.,” or “Prez” for short. Prez is currently composed by 
George Bush, was composed in 1790 by George Washington, and in 
1863 by Abraham Lincoln. In some other worlds, Prez is currently 
composed by Mike Dukakis, and in others by Elizabeth Dole. A per- 
_ $on comes to constitute Prez, usually, by being sworn into office, 

though it may not work this way in all other worlds. Prez bears 
basically the same relation (at a time) to the people that are (or have 
been, or could be) Prez that a statue or river bears to the portions of 
matter (clay, water) that constitute them (at a time).'* Prez is not a 
mereological sum of the people who have been (are) the president 
(unless these other entities [statues, rivers] are, which they are not 
—at least of those things which actually constitute them, supposing 
mereological sums to have their parts essentially). When someone 
leaves office, Prez does not go out of existence, but rather, comes to 
be constituted by another person, as the Nile comes to be consti- 
tuted by different (portions of) water.” 

Of course, Prez differs from statues and rivers in that it isnot 
continuous through space and time—when there is a swearing in, 
Prez jumps in space to occupy the location of the person who has 
just been sworn in. Between such ceremonies, though, Prez follows 
a person-like path of spatiotemporal and psychological continuity. 
Fully precise identity conditions for Prez would be tricky, but I hope 
the basic idea is clear. Prez is constituted by, and not identical to, 
the people who hold the office of President of the U.S. And even in 
worlds where the presidency is abolished after Washington, Wash- 
ington would be Prez only in the sense of constitution, as they would 
still differ modally, and in their identity conditions. 

Prez is person-like in that it has mental states and performs inten- 
tional activities (‘The President of the U.S.” vetoes bills); however, 
as just noted, it does not have the identity conditions persons have 
(say, spatiotemporal or psychologicaP continuity). For a general cate- 
gory, we may call Prez an “office person”; other office people are 
“The Owner of the Yankees” and “The Pope.” Office people also 
belong to broader class of (purported) entities, which includes “My 
Dog,” which was composed by Fido, and now by Spot, and “Eliza- 
beth Taylor’s Husband,” which I leave to the reader to articulate. 


‘On constitution, see David Wiggins, Sameness and Substance (New York: 
Oxford, 1980); and “On Being m the Same Place at the Same Time,” Philosoph- 
cal Review, LXXVI, | (January 1968): 90-5. 

18 The Nile also comes to be constituted by different water, but here I am 
concerned with the relation between ‘individuals. 
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For reasons which should be plain, I shall call all of these ‘‘unusual” 
entities; for now, it should suffice to say that unusual entities are 
ones that we do not usually (explicitly, at least) recognize as (single) 
entities. 

Let us call an ontology that recognizes unusual entities a permis- 
sive ontology, and one that does not restrictive. Further, let us call 
an ontology that recognizes no objects—one according to which 
there is stuff in the world, but none of it endowed with the sort of 
identity conditions we think distinguish objects from mere stuff—a 
fully restrictive ontology. We may then call an ontology that is nei- 
ther fully restrictive, nor permissive, a privileged ontology. In a pri- 
vileged ontology, some but not all ways of carving up the world get 
at real objects; some but not all proposed identity conditions are the 
identity conditions for some class of objects." It is this sort of ontol- 
ogy to which defenders of MRD are committed.’” 

As I said, I shall not argue against a privileged ontology, but it is 
incumbent upon the defender of such a view to say what the favored 
entities have, and unusual ones lack, in a way that neither leads us to 
wholesale restrictivism, nor just makes us want to introduce a new 
general term to include both favored and unusual entities. As a 
middle position, it needs to withstand challenges from both sides, 
and it is not obvious that acceptable answers to one of the other 
positions can be made which will not push one to the opposite view. 
While there are many differences between people and office people, 
it is very questionable whether any of these has the right sort of 
metaphysical significance.'® 

IV. WHY MRD REQUIRES A PRIVILEGED ONTOLOGY 
(a) Restrictive ontology. It should be plain that one cannot advocate 
the metaphysical understanding of rigidity with a wholly restrictive 
ontology. Since on this view there is no (mind-independent) identity 
across possible worlds, the facts of identity just are not up to the 


16 Of course, everyone that conditions like ‘horse with a horn’ pick out 
no real objects. Whether a of carving’ gets at real objects rather concerns its 
tndividuative force given its actual application. Wholesale restrictivists allow 
that there are “portions of the world” which are composed of H,O, but deny 
that, mind-independently considered, this condition is individuative, or the es 
sence of anything 

17 In fact, their ontology must rule out not only the “unusual” entities under 
discussion, but quite a few not-so-unusual entities, since any widespread spatio- 
ia aa a lear elect Seas lems of indeterminacy to be dis- 
cussed. Our “unusual” entities problem more vivid. 

18 Eli Hirsch offers a tho challenge to privileged ontologies in “Rules for . 
a Good Language,” this JOURNAL, LXXXV, 12 (January 1988): 694-717. A rare 
explicit defense of a privileged view, Pea AN G O E eerie 
occurs in Sydney Shoemaker, “On What There Are,” Philosoph Topics, XVI, 
1 (Spring 1988): 201-24 
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task of determining the counterfactual reference of a term— 
indeed, even actual reference is problematic. If one tried to intro- 
duce a term as rigid de jure, to refer in other worlds to whatever is 
identical with the actual referent, the term would just fail to refer in 
other worlds. And obviously, the metaphysics (1}5) characterize 
could never get off the ground. Put simply, MRD presupposes 
mind-independent crossworld identity, and with a wholly restrictive 
ontology, this supposition is false. 

Despite this obvious incompatibility of a wholly restrictive ontol- 
ogy with the new metaphysics, it is worth noting. For if MRD also 
cannot abide a permissive ontology, then the defender of MRD can- 
not just defend a restrictive (as opposed to a permissive) ontology, 
but further, this defense cannot lead back to extreme restrictivism. 
The metaphysical understanding of rigidity requires a privileged, 
and not merely a restrictive, ontology. 

(b) Permissive ontology. Can rigidity be metaphysically determined 
in a permissively populated world? I shall first consider the rigidity 
of descriptions, and then apply our findings to the case of names. 

As standardly discussed, descriptions may be used either rigidly or 
nonrigidly. If “The President of the U.S.’ refers through sense, it will 
denote Bush nonrigidly; if it refers counterfactually through iden- 
tity, it rigidly designates Bush. But now suppose we allow Prez in our 
ontology. Referring through sense will no longer suffice for ‘The 
President of the U.S.’ to be nonrigid, for in virtue of its descriptive 
content, it can pick out Prez just as well as Bush, and if it picks out 
Prez, it does so in every possible world—for Prez is in every world 
and always the President of the United States. Indeed, one might 
think that Prez, if it exists, is the more natural referent of ‘The 
President of the U.S.’, so on a permissive ontology, every definite 
description, if used to refer through sense, threatens to be rigid de 
facto. For now, though, all that matters is that on a permissive ontol- 
ogy, descriptions used to refer through sense, without further speci- 
fication, are indeterminate between rigidly denoting unusual enti- 
ties, and nonrigidly denoting the particulars that constitute them at 
a given time.!® 

On the other side of the coin, using our description to refer 
counterfactually through :dent{ty leaves it open whether the value 
of ‘The President of the U.S.’ in other worlds is Bush or Prez. For 
again, without further specification, they have equal claim to be the 


® This is not to say such descriptions would always be indeterminate—only that 
when they are not, it is not just in virtue of intending to refer counterfactually 
through sense 
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actual referent of the description, but, of course, they determine 
different referents in other worlds. To designate Bush rigidly with 
this description, it will not suffice simply to intend to refer counter- 
factually through identity. 

In both cases, a permissive ontology threatens the determinacy ot 
reference. In the case of referring through sense, this appears as a 
puzzle about how we can manage to use descriptions nonrigidly; in 
the case of referring through identity, the problem is determining 
which of (at least) two things is rigidly designated. Whereas on a 
wholly restrictive ontology, there are too few mind-independent 
crossworld identities (i.e., none) for MRD, on a permissive ontology, 
there are too many. MRD presupposes not only that there are some 
identities, but, since it asserts that counterfactual reference is deter- 
mined simply by the fact that some object in another world is identi- 
cal to the actual referent, that it is possible to refer determinately to 
some actual object in a way that allows this. 

Now, it ts possible to get determinate reference even within a 
permissive ontology. Since the objects the reference hovers between 
differ in their identity conditions, we can determine reference to 
one of them by intending to refer to the object with such-and-such 
identity conditions. Unfortunately for MRD, this is not a way to 
secure reference which just lets the ‘‘facts of identity” determine 
counterfactual reference—for here, this reference is determined 
semantically, not metaphysically: the counterfactual referent is 
whatever satisfies the intended conditions. By associating identity 
conditions with uses of expressions, we can also make out, despite 
the above problems, how there can be nonrigid uses of descriptions, 
even with a permissive ontology. This will be explained in section VI. 

It should be clear that these same problems apply when we turn to 
names. On the standard view, we get a referent for a name—via 
ostension or description—and then let this referent guide counter- 
factual reference. Clearly, when a name is introduced by a descrip- 
tion, our position is just as found above for the rigid use of descrip- 
tions—simply intending the facts of identity to determine counter- 
factual reference underdetermines such reference. But the same 
holds true for more purely ostensive introductions of names. Within 
a permissive ontology, wherever we point, there are too many over- 
lapping entities to which we may be referring, entities which differ 
in their identity conditions and modal properties. Until we specify 
what is to count as the same thing, we have no determinate refer- 
ence to any one thing at all. Of course, we do achieve determinate 
reference with, say, ‘Bush’. But this leaves completely open Aow this 
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occurs. The argument is not that ‘Bush’ cannot be determinate and 
rigid, but only that for it to be so, it must have more semantic 
content than just its referent. So again, there is no way to let the 
thing—the referent—determine the counterfactual reference of 
the name rather than, as in the case of descriptions, the criteria of 
sameness (identity). For a name to be rigid, it must be associated 
with a criterion of identity across worlds, and this then serves as the 
criterion of application for the name. While there can be rigidity, 

To sum up, MRD involves two presuppositions: first, that there 
are mind-independent facts of crossworid identity; and, second, that 
we can get determinate reference in the actual world in a way that 
allows these facts of identity to determine reference in other worlds. 
On a wholly restrictive ontology, the first (and maybe the second) is 
false; and on a permissive ontology, the second is false. Both suppo- 
sitions can be satisfied only within a privileged ontology; conse- 
quently, MRD, and the philosophical picture that requires it, re- 
quire such an ontology. Some who accept these views may be happy 
with this consequence—but the burden is on these “new” metaphy- 
sicians to defend this ontology. They cannot just say, “Well, since it 
is entailed by our position it is as well supported as the rest of our 
view.” For the principle evidence for the view starts from our intu- 
itions about rigidity, and as we have seen, it is not these data them- 
selves, but their metaphysical interpretation that is the cornerstone 
of the new view. Since this interpretation presupposes a privileged 
ontology, one is not entitled to it, or what one thinks flows from it, 
without first defending the ontology. But one might think this de- 
fense need not be wholly independent. Perhaps our judgments of 
rigidity themselves support a privileged ontology. I shall argue in 
the next section that this is not so. 


V DO OUR DIFFERENTIAL JUDGMENTS OF RIGIDITY SUPPORT 
A PRIVILEGED ONTOLOGY? 


To get people to see the rigid/nonrigid distinction, we ask similar 
questions about different terms. If we look at counterfactuals using 
the name ‘Nixon’, it seems that all are used to talk about ‘‘this 
man’’—the same one we speak of in actual situations. On the other 
hand, if we look at ‘The President of the United States’, it seems 
that, in other worlds, it may pick out someone else. Asked, of all the 
counterfactual referents of this expression, if they are the same, we 
naturally say “no.” In saying no, we assert a nonidentity, and it is a 
short step from here to supposing that there is no such entity as 
“The President of the United States.” At least, if we accepted a 
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permissive ontology, would we not be likely to say that “The Presi- 
dent of the U.S.’ was rigid, that its referents were identical? And if 
we were restrictivists, would we not deny that ‘Nixon’ was rigid, that 
its referents are identical? 

Thus, our common judgments of rigidity and nonrigidity might be 
thought to lend support to a privileged ontology, or at least show 
that most people accept this ontology. And one might think this at 
least puts the burden of proof onto the view’s opponents. 

This argument from our differential judgments about rigidity to a 
privileged ontology is problematic. I do not think our judgments 
support the claim that we do accept such an ontology, must less the 
claim that we should. For one thing, if one were to ask people who 
make the familiar judgments about nonrigidity whether or not they 
thought, say, that “The President of the U.S.” existed, I think many 
of them would at least admit that they had no special reason for 
denying this. “Sure, why not?” would be a common response, I am 
guessing. They need not be thinking of such entities when the ques- 
tions of rigidity are raised; one may need to ask explicitly. This might 
lead them to reconsider their earlier judgments that “The President 
of the U.S.’ is not rigid, in which case the “phenomena” might 
vanish. But they need not. One may think that while “The President 
of the U.S.” populates the universe, it is not the referent of many 
(or any) uses of ‘The President of the U.S.’ That ‘The President of 
the U.S.’ does not always have the same referent—or that we do not 
. think it does—is immediately just a fact about what we are referring 
to—not what there is (or is not). In section VI, I provide an account 
of how this description could be used nonrigidly (and rigidly) to 
denote particular people, even if there were “The President.” So 
judgments of nonrigidity, of themselves, can hardly show that we are 
committed to a privileged ontology. Further, and perhaps more im- 
portantly, I think we can give a plausible reason why people would 
make the nonrigidity judgments they do even if, on reflection, they 
were the firmest of permissive ontologists (or, suitably modified, the 
firmest of extreme restrictivists). 

The questions we ask when attempting to decide whether a term is 
rigid or not are not in any obvious way questions about what entities 
exist. Generally, they are identity questions, e.g., ‘Is the same entity 
present in all relevant situations’? Or ‘Is the referent of the term in 
all worlds the same’? So it should be no surprise, first off, if these 
inquiries do not pique people’s ontological sensibilities. This point is 
highlighted by other philosophically familiar uses of variables like 
‘thing’, ‘object’, and ‘entity’. Asked flat out, it is familiar that ques- 
tions and statements involving such variables are deeply ambiguous. 
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‘How many things are there on the table’? asked of a table contain- 
ing a deck of cards, may legitimately receive a number of answers, 
most obviously ‘one’—a deck of cards or ‘fifty-two’—cards; or who- 
knows-how-many combined cards, numbers, spots, etc. These an- 
swers require no special permissiveness in one’s ontology—we all 
recognize cards and decks as among the world’s occupants. But still, 
to get a definite answer—or ask a definite question—we need some- 
thing more specific than ‘thing’. Interestingly, we use such general 
terms all the time, successfully and without problem. Particles of 
dust and cookie crumbs, for instance, are understood not to be 
counted when one asks ‘How many things are there on the table’? 
This question ordinarily means, roughly, ‘How many medium-sized 
dry goods are on the table’? My not counting cookie crumbs in no 
way shows that I doubt their existence or status as “‘things.” When 
we use these most general terms, we have more or less agreed upon 
“default settings” that specify more particularly the type of thing in 
which we are interested. We may have different defaults for differ- 
ent contexts and, certainly, one can specify directly at any time. The 
important point is that we need not verbally explicitly do so. And 
because these pragmatic factors can be tacit, our ordinary intuitions 
about these things hardly show much about our ontology. 

Given that we have such conventions, or conversational customs, 
concerning these most general terms, it is not hard to see how they 
might enter into our investigations concerning rigidity. Indeed, we 
need not bring in “The President of the U.S.,” and the like, to see 
that they do. Consider our question of whether the same entity is 
present in any situation relevant to ‘George Bush is necessarily 
male’. Nobody even thinks to say “Well, there is one collection of 
molecules the instantiation of maleness of which is actually rele- 
vant, but in other worlds, there are different such collections.” 
There is no denial implied here of the ontological ‘‘thingness” of 
these collections—they are entities, all right—it is just supposed 
that we are talking about medium-sized dry goods like the man, 
George Bush. 

Just so, I think it is with our nonrigid inquiries. Consider the 
reading of ‘The President of the U.S. need not be male’ which we 
defend by pointing to a world in which Elizabeth Dole is the Presi- 
dent. When asked whether this use is rigid—Does the same entity 
figure in all the relevant situations?—we answer “No,” since it is 
Bush who is relevant in the actual world, and Libby Dole in the 
world that proves the claim. Does this mean we accept, or would 
accept if asked, a privileged ontology in which ‘The President of the 
U.S.” does not appear? No more, I think, than ignoring the cookie 
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crumbs when counting the things on the table implies a similar re- 
jection of cookie crumbs. In the context, we naturally interpret the 
question as ‘Does the same person always appear’ ?—indeed, in prac- 
tice, the discussion often takes place in these terms: ‘The referent is 
not always the same person’. And to this, the answer is clear. This is 
also why collections of molecules do not figure in our intuitive rea- 
soning. (The question of rigidity, however, hinges on sameness of 
entity; so this finding, however clear, does not even suffice to show 
that in this sentence the expression is used nonrigidly, much less 
that a restrictive ontology is indicated.) Thus, I am not at all con- 
vinced that the fact that people can be led so easily to recognize 
definite descriptions as nonrigid shows that, at any level, they accept 
a privileged ontology. It only shows that we are accustomed to speak 
of a certain subclass of entities—even to the exclusion of other 
entities that we clearly do recognize. 

These reflections upon why we should not make too much onto- 
logical hay of our ordinary judgments of rigidity—or other judg- 
ments that may play on our “default” settings for general variables 
—also help explain why we are so naturally led to a metaphysical 
interpretation of rigidity and, ultimately, to the new metaphysics. 
Our ordinary practices—our “default” settings—have us speak, 
and perhaps believe, ‘as if’ we accepted a privileged ontology. Un- 
less these conventions are somehow flagged, we make our judg- 
ments within these practices, and may, without realizing it, incur 
more blatantly metaphysical commitments—in this case, to a privi- 
leged ontology. 

Now, part of the great elegance and beauty of Kripke’s work— 
especially his discussion of rigidity—is that we are explicitly asked 
not to “think as philosophers,” but to offer our common-sense 
judgments about how we use names and descriptions. Thus, in get- 
ting us to see the rigid/nonrigid distinction, and that names, but not 
all descriptions, are rigid, no ‘metaphysical flags” are raised. One 
would have to be predisposed toward ontological axe-grinding to 
think at that point that the question ‘Does ‘The President of the 
U.S.” always refer to the same thing’? was a “loaded” question. We 
have not yet incurred a privileged ontology, but our attention is 
firmly fixed on “usual” entities (people), and not at all on “deeper” 
ontological questions. As we move on to the metaphysics, and 
wonder how rigid expressions work, we still avoid the relevant sort 
of metaphysical flags. Names, we are told, are introduced by osten- 
sion or description, and are then used rigidly for ‘that thing’—what- 
ever we have ostended or described; ditto for rigidly used descrip- 
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tions. Now, if we are on our ontological guard, we may at this point 
complain that ostension or simple description will not suffice to pick 
out (determinately) a single, modally extended object. But there is 
no indication yet, nothing really to warn us, that we should be on 
such guard. Ostension is a perfectly normal practice; we know how, 
without needing to make a fuss, to identify ‘that thing’. This could 
be because we accept a privileged ontology, but it could as easily be 
because we have default settings here which we use all the time— 
without, again, even noticing it. And by now, without having had to 
confront explicitly any metaphysical questions—just working uncriti- 
cally within ordinary linguistic practices—we are positioned to start 
asking modal questions about ‘this guy’, or whatever our object of 
reference may be. Without needing to worry about how we have 
done it, we see ourselves latched onto an object of reference, and 
ready to ask what its possibilities are. And if the semantic structure 
of referring expressions is as I shall suggest, we shall find necessary a 
posteriori truths. If I am right, these truths depend, for their modal 
status, on our conventions, on conditions we ‘“‘in default” associate 
with the expressions. But there is no reason within the inquiry for it 
ever to occur to us that this might be the case. Once we are onto the 
rigid/nonrigid distinction, our attention is firmly focused upon ob- 
jects, not language. 

Thus, even if our referring expressions have the sort of semantic 
structure I suggest, it is perfectly simple and natural for us to go 
from the initial presentation of the rigid/nonrigid distinction all the 
way through to (1)}{(5) without ever having to reflect upon our ontol- 
ogy, to address explicitly the question of restrictivism, privileged 
ontology, and permissivism. But since in our ordinary practice we 
speak ‘‘as if” we accepted a privileged ontology (whether we do or 
not), it is no big surprise that we can wind up in the position I have 
urged a defender of (1)+5) is in—that of really being committed to 
a privileged ontology. The ‘‘as if’ character of the linguistic prac- 
tices combines with the absence of metaphysical flags raised during 
the investigation, so we in effect assume a privileged ontology in our 
interpretation of rigidity, and feel unproblematically determinately 
focused on “usual” modally extended entities.” 

Behaving and speaking as if something is true may not amount to 
assuming or presupposing it. But when this is used—without argu- 
ments for its truth—in arguments which explicitly commit us to the 

© Of course, some may explicitly accept a privileged ontology beforehand. But 


the ontology sill drives the metaphysics, and requires independent defense for 
the new metaphysics to stand. 
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view in question, it may more fairly be said that we have presup- 
posed, or accepted a commitment to it in the course of the argument 
—and not just as a result of our conclusion. This is what I mean by 
saying that a privileged ontology is assumed or presupposed in the 
acceptance of MRD and (1)}+5), and why I am inclined to say that a 
defense of a privileged ontology is required not only because it is a 
commitment of MRD and (1)-(5), but because its prior (tacit) accep- 
tance led to these positions. 

The new metaphysics has “seemed” to arise out of mostly linguis- 
tic considerations. But it has in fact arisen out of prior acceptance of 
a privileged ontology disguised in linguistic garb. This metaphysical 
view needs to be evaluated against its competitors with its own face. 
Privileged ontology is not a commitment of the new metaphysics—it 
is the new metaphysics. And whatever arguments may be available in 


their support, the phenomena of rigidity are not among them. 
VI SKETCH OF A SEMANTIC ACCOUNT OF RIGIDITY 


I conclude with a sketch of an ontologically neutral account of rigid- 
ity, which I shall call a semantic account, to contrast it with the 
metaphysical account we have been considering. It is semantic in 
that counterfactual reference and rigidity (or not) are determined, 
more or less (see fn. 26), by conditions analytically associated with 
referring expressions; so if this account is correct, MRD is false, 
undermining (15), and the metaphysical view expressed therein. 
Aside from seeing how these phenomena can occur within nonprivi- 
leged ontologies, I hope it will be welcome just to see a semantic 
treatment presented, as the metaphysical one is usually just 
assumed. 

We begin by considering how there could be determinacy of rigid 
reference with a permissive ontology, starting again with descrip- 
tions. Earlier, we saw that on such an ontology, we would need to 
associate criteria of identity with “The President of the U.S.’ for it to 
determinately refer to either Bush or Prez, since its “surface” con- 
tent applies equally to both. If we also allow such associated criteria 
to determine the reference of the description in other worlds, then 
we have both determinate actual reference, and rigidity as well. For 
the description then applies, in any world, to whatever satisfies the 
criterion of identity—and, so, to that which is identical to the actual 
referent—in that world. Determinacy and rigidity are handled in 
one fell swoop. As we noted, this has the consequence that counter- 
factual reference is determined by semantic conditions associated 
with the term, not just by ‘identity with the actual referent’—for we 
do not kave an actual referent until we have settled on identity 
conditions. But we are not here concerned to preserve MRD; in- 
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deed, our semantic account is supposed to highlight how MRD can 
fail to be the correct account of rigidity, and how it does fail in 
nonprivileged ontologies. 

One might think that, if we fail to associate the criterion of iden- 
tity for persons with “The President of the U.S.’, the description is 
not rigid, but refers nonrigidly to Bush. This, however, ignores the 
fact that, on our specification, ‘being President’ is associated as a 
criterion of identity (not just of application), so the referent is Prez, 
even in worlds where Libby Dole is the President (and so we have 
rigidity). A more serious worry is that it may now seem impossible— 
as we earlier worried it might be—to have nonrigid descriptions. If 
determinacy of reference requires that we associate a criterion of 
identity with a description, and this criterion determines its counter- 
factual application, the referents will always be identical, so all de- 
scriptions will be rigid. The only question is which entity it rigidly 
designates. 

The supposition that an associated criterion of identity must also 
serve as the counterfactual criterion of application for a term is 
mistaken. While this is typical as such matters are usually discussed, 

there is no reason why we might not associate, with a single expres- 
- gion, one condition as the criterion of identity, and a different one 
as its criterion of counterfactual application. The two, after all, per- 
form different functions. Criteria of identity tell us under what con- 
ditions we have the same object as the actual referent. Criteria of 
application tell us under what conditions the expression in question 
applies in another world. I suggest that we need to distinguish the 
two carefully, and that descriptions (and ultimately, all referring 
expressions) need to be associated with both criteria of application 
for other worlds and criteria of identity. 

Criteria of identity serve not (of themselves) to determine coun- 
terfactual reference, but actual reference (by distinguishing among 
the candidates satisfying the ‘‘surface” criteria of application, e.g., 
‘is the President’); criteria of application determine counterfactual 
reference. When these are the same, as we have seen, the descrip- 
tions will be rigid—even when the criterion of identity is “‘un- 
usual.”?! But the criteria need not be the same; we can associate the 
identity conditions for people with “The President’—making Bush 
the actual referent—while using the familiar descriptive content as 
our criterion of counterfactual application. This results m—and is, I 
suggest, how we get—nonrigid descriptions: that to which the ex- 


2A , it 1 eno if the critena entail each other. This may be a better 
picture of rigidity of complex expressions for numbers. ads 
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pression applies in a world can differ from that which satisfies the 
criterion of identity.” 

The associated criteria need not be explicitly formulated. Typi- 
cally, we recover them from the sorts of counterfactual situations 
one counts for or against one’s statements using these descriptions. 
As we may postulate an intention to refer rigidly, and so, at least 
implicitly, through identity, when someone does not count the possi- 
bility of a Libby Dole election against their utterance of “The Presi- 
dent could not have been a woman’, so we may postulate a more 
detailed intention if this is needed to get the reference of their 
expression to be Bush, rather than Prez. Similarly, our judgments of 
nonrigidity show that we intend the counterfactual application of 
the term to be given by conditions other than the criterion of iden- 
tity (though criteria of identity still need to be associated to make the 
actual referent Bush rather than Prez), and license a similar postu- 
lation. 

Here is a representation of the semantics of descriptions (‘CA’ 
= criterion of application; ‘CI’ = criterion of identity; ‘@’ stands for 
the actual world; ‘w,’ for other worlds; ‘(v) ’ stands for the value of 
conditions ‘ ’) [this is a first approximation]: 


(1) Rigid descriptions (normal) (e.g., “The President’ rigidly designat- 
ing Bush):” 
CA@ = Being the President (one might spell this out further) 
CAw, = (say) Having the same origin (in w,) as (v}CA@ 
CI = Having the same origin as (v)CA@ 
(2) Rigid descriptions (unusual) (e.g., ‘The President’ rigidly designat- 
ing Prez) 
CA@ = Being the President 
CAw, = Being the President 
CI = Being the President™ 
(3) Nonrigid descriptions (‘The President’ refers nonrigidly to Bush) 
CA@ = Being the President 
CAw, = Being the President 
CI = (say) Having the same origin as (v) CA@ 


™ The question of why we might ever give these criteria different conditions is 
an interesting one, and there are interesting things to be said about why in prac- 
tice philoso tend to run them together, but space limitations preclude a 
fuller discussion here. 

™ If (as I thmk) Kerth Donnellan’s distinction 1s genuinely semantic, the actual 
criteria for “referential” descriptions would be given not by the “straight” con- 
tent of the description, but by causal conditions (see the account of names below), 
or some other “intended” content. 

™ I realize this does not really have the normal form of a criterion of identity— 
‘x bears R to y’. ee eee meses 


_ to be specified n 
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The “surface” sense of a description gives the parameters of the 
actual conditions of application, the criteria of application for other 
worlds say what conditions something must meet in another world 
to be the referent of the expression in that world, and the criteria of 
identity specify the conditions something in another world must 
meet to be identical to the actual referent—and thus also determine 
the actual referent.” Note that on the usual treatment, we would fill 
in ‘CI’, for rigid descriptions, with “Being identical to the actual 
value of the description.” But in a permissive ontology, this will not 
suffice—for it is not determined whether it is identity with Bush or 
Prez that is indicated. The identity conditions must at least be speci- 
fied according to the kind of thing in question, if not more finely.* 

In normal rigid descriptions, we let the criteria of identity serve to 
determine the counterfactual criteria of application. This is the ker- 
nel of truth, in a permissive ontology, in the idea that we can refer 
through identity: we can refer through criteria of identity, so it is 
“built in” semantically that the expression has the same reference in 
all worlds. To this extent, descriptions can be rigid de jure (see fn. 
13).? But since descriptive conditions, rather than the metaphysics 
of identity determine the reference, MRD fails here. 

This, in brief sketch, gives our semantic account of rigidity. The 
rules of reference, for both rigid and nonrigid descriptions, are 


3 Criteria of counterfactual poan mentioning (v}CA@ do not quite just 
plug in the actual value of the ipuons (for die CAG do nota themselves 
determine an actual value), but are more like functions from the “actual” content 
and identity conditions to objects in other worlds—and the criteria for the actual 
world may be seen either as providing ambiguous actual reference, or better (at 
least usually), as functons from “surface” content and criteria of identity to a 
referent. This is why the above sketch is a “first approxmation.” Cntena of 
identity could thus be considered as part of the criteria of actual application, but 
it is useful to distinguish their contributions. Note that criteria of identity play the 
same role in getting determinate reference in other worlds. In theory, there 1s 
room for distinct criteria of identity for the actual and souihitertaccial vehercata 
(when CAw, is not given by CI). But Ì am not aware of any expressions that 

function this way 

*® I mean here to reemphasize that the criteria need not specify the fea- 
tures counterfactual referents of rigid terms must have (see fn. 7)—{1) above 
leaves open what the origin is, and so what it is in other worlds (though not that it 
is the same in all worlds)—thus allowing for necessary a posteriori truths (‘Bush’s- - 
origin is ’). But it would be misleading to speak of this as the empirical 
discovery of essence, for what is empirical is just the value of parameters, the 
Gsenualley of which i dét by the gige: and a Mon. 

Y Within this framework, then, the de jure/de facto distinction becomes 
blurred, or imperspicuous. If we identify rigid de jure expressions as those which 
are descriptionless, as in MRD, then there are no such ressions. If we i 
only that rigidity be semantically built in, there is no rigidity de facto. 
Perhaps the intuitive distinction divides terms where the criteria are the same 
from those which imply each other. 
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given semantically, not metaphysically. Because of this, the account 
can also work within a restrictive ontology. In this case, one might 
say that it is not the metaphysics of identity which determine the 
counterfactual referents of rigid terms, but rather, the semantics of 
rigidity which determine what is to count as identical across worlds. 
And while in a privileged ontology, rigidity can be determined meta- 
physically, there is nothing to rule out the semantic story being just 
as I have outlined it. Thus, the semantic account of rigid designation 
is neutral with respect to our three “worldviews,” while the ‘‘meta- 
physical” account can work only within a privileged ontology. This, I 
believe, provides some ground to prefer my treatment whatever 
one’s metaphysics—for while the metaphysical questions are mat- 
ters for dispute, it hardly seems a real matter of dispute whether we 
can and do use expressions rigidly. 

Finally, back to names. For a name to be rigid, it must be asso- 
ciated with a criterion of identity across worlds, and this then serves 
as the criterion of application for the name, viz: 

(4) Names (‘Bush’): 

CA@ = Description/ostension/causal relation 

CAw, = Whatever satisfies CI in w, 

CI = (say) Having the same origm as (v}CA@ 
Names, then, like rigid descriptions, are rigid not by lacking descrip- 
tive content, but because their criteria of application are the same as 
(given by) their associated criteria of identity.. This allows us, in 
theory anyway, to identify a possible nonrigid use of names in which 
these two are separated (‘Superman’ or ‘Mr. President’ are possible 
examples). We may think of such names as disguised descriptions; 
the point is just that nothing prevents ‘‘ordinary names” —linguistic 
items with no surface syntactic or semantic structure—being non- 
rigid. We simply do not ordinarily so use them. 

To restate my main contention: once we distinguish the phenome- 
non of rigidity from its metaphysical interpretation, and see that the 
latter hinges on just the metaphysical view the phenomena are 
thought to support, we can refocus the debate at the metaphysical 
level, where it belongs. The “new essentialism”’ is nothing more nor 
less than advocacy of a privileged ontology, and needs to be de- 
fended as such—as a metaphysical view that neither flows from nor 
is supported by the fact that some, but not all expressions are rigid. 
By now, it is probably clear that I am rather skeptical about the 
possibility of a successful defense of this view. But arguments await 


another occasion. 
ALAN SIDELLE 
University of Wisconsin/Madison 
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ompare the single-event interpretations of 
(1) Sly Pete will win. 
(2) ‘Sly Pete wins. 
(3) Sly Pete won. 


Call them m,, mg, and ms, respectively. It is popularly presumed 
that these three differ only in the times they assign to Sly Pete’s win, 
and more particularly that m, formulates about the future exactly 
what m, and m, formulate about the present and past—whatever 
that might be. I shall demonstrate that this presumption cannot 
be right. 

My argument is premised on a theory and a fact. The theory is 
about how reported speech works. The fact is about what English 
speakers say to report an affirmation of m,—or rather what they do 
not say. To keep the two premises separate, I shall use different 
examples when doing the theory, beginning with the single-state 
interpretation m, of ‘I am awake’. 

If a speaker X affirms m, on Monday, then as a matter of observa- 
tion a different English speaker Y will report this on Tuesday by 
saying ‘X said she was awake’. The single-state interpretation m, of 
the embedded sentence ‘she was awake’ is of course a different 
message from m4, the latter attributing present wakefulness to who- 
ever the speaker is, the former past wakefulness to a third person. 
But each provides its speaker with means of referring to the same 
matter of historical fact. ‘I’ is Xs way, and ‘she’ Y's, of identifying X 
as the person whose wakefulness at X’s moment of utterance is lo- 
cated as present by m, and past by m,. That then is how reported 
speech is meant to work: the aim is to preserve objective substance 
in the face of changed circumstances of transmission.! 

Taken together with the presumption that m, merely formulates 
about the future what m, formulates about the past, this theory 
entails that, if X affirms m, regarding the coming evening, Y ought 
to report this next day by saying ‘X said Sly Pete won.’ For with the 
relevant evening future with respect to X’s point of speech and past 
with respect to Y's, m, achieves exactly the objective substance for Y 
that m, achieved ex hypothesi for X. But as a matter of observation 
this is not so. Therefore the presumption is false; q.e.d. 

‘In particular, location in the A-series is adj to preserve location in the 
B-series. See J.M.E. McT “The Unreality of Tıme,” Mind, xvu (1908): 
457-84, esp. 458. This is all that matters for my demonstration 
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If this argument works at all it works for any natural language. 
For instance, we are standardly taught that ‘vincera’ and ‘vinse’ are, 
respectively, a ‘‘future-tense’”’ and a past-tense form of the Italian 
verb to win. But if the guiding principle of Italian reported speech is 
to preserve objective substance—an issue concerning which the evi- 
dence is there to be culled—then we must resist the natural implica- 
tion that the only difference in meaning between the two forms 
concerns the time of the win, because as a matter of observation 
‘vincera’ is reported after the event by something quite other than 
‘vinse’, namely, ‘avrebbe vinto’. 

The argument applies in narrower cases as well. Consider the 
single-events interpretations mę and m, of 


(6) If Sly Pete calls he will win. 
(7) If Sly Pete called he won. 


Observation of how English speakers report prior affirmations of m, 
reveals the impossibility that mg might merely formulate about the 
future what m, formulates about the past. Yet every theory of “‘con- 
ditionals” I know seems to presume exactly that, recommending the 
very same semantic treatment of each. 

A swifter refutation. What Y in fact says is ‘X said Sly Pete would 
win’, or ‘X said that if Sly Pete called he would win’.? But according 
to our popular presumption, the function of ‘will’ in (1) is to locate 
Sly Pete’s win directly in the future, just as ‘won’ in (3) locates it 
directly in the past. And nothing ‘would’ could possibly do could 
match the sheer simplicity of this arrangement. The popular pre- 
sumption leaves it inexplicable why Y should resort to ‘would’ to 
report Xs ‘will’. 

V. H. DUDMAN 
Macquarie University 


? Elsewhere I have advanced a theory that can explain why this should be so— 
most recently in parts 4 and 5 of “Interpretations of ‘If-sentences,” in Condi- 
ttonals, Frank Jackson, ed. (New York: Oxford, 1991), pp. 202-32. But here my 
purpose is wholly negative. 
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Wittgenstein’s Philosophy of Psychology. MALCOLM BUDD. New York: 
Routledge, 1989. xiv + 186 p. Cloth $39.95, paper $14.95. 


Sapiently gauging his distance, Malcolm Budd offers us a synoptic 
view of Wittgenstein’s later thoughts on the mind. His treatment is 
selective yet representative. Seven dense chapters survey Wittgen- 
stein’s various surveys of meaning and understanding, sensations 
and sense impressions, the seeing of aspects, images, internal speech 
and calculation, thought and intention, and feelings, emotions, and 
bodily awareness. Budd’s expositions are in every way exemplary. 
He is deeply sympathetic to his subject but by no means uncritical. I 
know of no other work on Wittgenstein that can match his for clar- 
ity, accuracy, concision, and penetration (and there are some fine 
books out there). He knows exactly what he is talking about and is 
able to convey this to the reader with unostentatious efficiency. His 
book would be perfect for students still trying to find their feet with 
Wittgenstein, but it can be heartily recommended to scholars who 
think they know their way around the Austrian’s later thought. Here 
is a book on Wittgenstein that you can trust. 

Beginning with a chapter on method, Budd emphasizes the pro- 
phylactic character of Wittgenstein’s intentions: the notions of 
grammar, language game, and perspicuous representation are to be 
seen in a curative light. He brings out the consequent radical dis- 
juncture Wittgenstein perceived between our ordinary mastery of 
psychological concepts and our bewilderment once we reflect upon 
this mastery: we go from genius to dolt simply by taking a downward 
glance at our concepts; we simultaneously grasp and do not grasp 
what we mean. Budd is sensitive to the apparent paradox this pre- 
sents and does his best to explain how Wittgenstein lived with it. I 
feel that this issue could stand further work: Did Wittgenstein really 
have an adequate account of the depth of error he attributed to our 
reflective understanding, giveri that he took our ground-floor con- 
ceptual grasp to be essentially flawless? 

Chapter II sets out Wittgenstein’s position on meaning, rules, 
conscious processes, mental mechanisms, capacities, reasons, and 
communication: These topics have been hotly contested in recent 
philosophy, stimulated largely by Saul Kripke’s “community inter- 
pretation” of Wittgenstein’s position. Budd shows definitively, I 
think, that this misrepresents the essence of Wittgenstein’s view; he 
stresses instead the contrast between inner and outer. The main 
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target of Wittgenstein’s critique is the idea that meaning is a quality 
of consciousness—something that happens in you. 

Wittgenstein is surely right about this, but it does not follow that 
the concepts of meaning and consciousness are unconnected, nor 
that the latter is not a necessary condition of the former. Meaning 
can be (and be essentially) something we do consciously without 
thereby consisting in an introspectible quality of consciousness. 
Wittgenstein seems not to have explored other less direct ways in 
which meaning and consciousness might intersect. 

Budd’s presentation of the private-language argument is impres- 
sively clear and thorough. Since first-person ascriptions are criter- 
ionless, private sensations could not afford the constraint on correct 
use which governs our actual practice of ascribing behaviorally ex- 
pressed sensations. The would-be private linguist is thus bereft of 
any means of establishing a reference relation. Budd appears to 
accept this argument, while plausibly criticizing Wittgenstein for ex- 
truding causality from his account of sensation concepts. I would 
say that the introduction of a causal relation between sensations and 
self-ascriptions provides one way of resisting the argument. We 
should, first, remind ourselves of how surprising the conclusion of 
Wittgenstein’s argument ought to seem. He invites us to believe that 
someone whose sensations happen to have no behavioral expression 
is semantically impotent in respect of those sensations. The subject 
has the sensations, she can speak a language for public objects, but 
somehow she just cannot get her words to refer to her sensations. 
Only if her body offers up potential criteria for others to detect her 
sensations can she succeed in homing in semantically on those sen- 
sations. Suppose the sensations are publicly undetectable up to time 
t and then acquire behavioral manifestations at ¢, only to revert to 
privacy ten minutes later. According to Wittgenstein, the subject 
could not refer to her sensations before t, though she can once her 
body starts to show their presence to others; and when the behav- 
ioral manifestations go her sensation words lapse back into mere 
empty sounds. Surely this is a remarkable result: it ties my semantic 
abilities with respect to my own mental life to the epistemic powers 
of others in detecting what it is I am experiencing. 

The suspicion that some form of verificationism underlies Witt- 
genstein’s argument is reinforced if we consider what a causal 
theory of reference might have to say about the possibility of a 
private lexicon, since such theories do not locate reference in the 
speaker’s ability to check that he is referring correctly—only in the 
objective obtaining of causal relations between objects and uses 
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(think, say, of Jerry Fodor’s asymmetric dependence theory). Since 
Budd accepts that sensations cause self-ascriptions, he is not pre- 
cluded from considering a theory of this kind. And if a causal theory 
can be applied to sensation words, we have this result: so long as the 
private objects bear the appropriate reference-conferring causal re- 
lations to first-person uses, that will be sufficient to underpin a 
semantic relation—no matter how private the sensations are. In 
other words, causal theories impose reference conditions that are 
inclusive enough to permit a private language (the same could be 
said of teleological theories). For such theories do not build in any 
necessity for anyone to be able to verify that words are being used as 
their semantics demands; it simply has to be the case that tokens of 
pain’ (say) appropriately covary with the occurrence of pains in the 
speaker. These theories may or may not be adequate, but we need to 
consider them if we are interested in whether the privacy of a sensa- 
tion thwarts reference to that sensation. 

Nor should we let the general mysteriousness of the reference 
relation induce us to suppose that reference is uniquely impossible 
where private objects are concerned, since that would be to pin the 
blame on the wrong thing. Maybe we cannot really explain reference 
in any context; it is just that this becomes more obvious when we 
strip reference down to its purest form. In any event, the private- 
language argument seems to me a good deal more questionable than 
it seems to Budd. 

The chapter on aspect perception is a subtle and lucid discussion 
of the way in which Wittgenstein demonstrates the “polymorphous 
character” of the concept of seeing. Seeing an aspect is neither 
quite like seeing color and shape, but nor is it a case of interpreting 
what one strictly sees. The fact is that it lies between sensation and 
thought and cannot be assimilated to either. The phenomenon thus 
serves to expose the simplifying errors inherent in both empiricist 
and rationalist views of mind. Our mental concepts are much less 
monolithic than philosophers have supposed. 

On the distinction between visualizing and seeing, Budd ably ex- 
pounds Wittgenstein’s idea that the difference consists essentially in 
subjection to the will. One’s attitude in visualizing is active, while in 
seeing one is a passive observer. The difference is not a matter of 
experiential content or quality. This is held, plausibly, to explain 
what may appear anomalous: that one cannot see and visualize the 
same thing at the same time, though one can do this for different 
things—since one cannot be active and passive toward the same 
experiential content simultaneously. 
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Wittgenstein’s opposition to the idea of an inner vehicle of 
thought is shown to be an instance of his wider point that nothing of 
an intrinsically nonrepresentational nature could ever be the basis 
of one’s knowledge of the content of one’s thought, so that the 
putative inner vehicle could play no epistemic role in an account of 
self-knowledge of intentional states. I would agree with this point 
but note that it does not follow that the inner vehicle might not be 
needed for other purposes. Indeed, given Budd’s earlier insistence 
on the causal properties of sensations, he might regard with sympa- 
thy contemporary thinkers (like Fodor) who maintain that no ac- 
count of intentional causation is possible without introducing an 
inner vehicle whose structure encodes the causal powers of thought. 

The book concludes with a fresh discussion of the relation be- 
tween emotion and proprioception, on the one hand, and somatic 
sensation, on the other. As we might expect by now, Wittgenstein’s 
main target is the idea that bodily feelings are constitutive of emo- 
tion or proprioception. Such feelings accompany one’s fear or one’s 
awareness of the position of one’s arm, but the concepts are not 
concepts of such feelings—rather as certain sensations accompany 
linguistic understanding without constituting it. Nor do we know 
our emotional state or bodily position by means of our awareness of 
the accompanying sensations. This explains something that might 
otherwise seem puzzling: how a horrible or agreeable emotion can 
be so much more powerful than the somatic perturbations that go 
with it. Affective tone is more a matter of the intentionality of the 
emotion than its bodily feel. 

In the preface to his book, Budd makes a nice acknowledgement 
to me, adding that “he has not seen what I have written and is 
unlikely to agree with all of it.” Well, now I have seen it and agree 
with far more of it than might be thought proper in a reviewer (and 
not merely because we are friends). The book is a model of Wittgen- 
stein exegesis. Only on the soundness of the private-language argu- 
ment do we seriously disagree: he thinks it works and I do not. 

COLIN McGINN 
Rutgers University 


Locke: Volume I, Epistemology; Volume IT, Ontology. MICHAEL AYERS. 
New York: Routledge, 1991. x + 341 p. and vi + 341 p. Cloth 
$127.50 (set). 


This book is something of a tour de force. Lf, as I understand, it has 
been a long time in the writing, the time was well worth taking. Over 
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its very considerable length it sustains (and deserves in response) an 
unflaggingly high level of serious thought. The bare simplicity of its 
title belies its complexity and richness. In an introduction that dis- 
cusses the relation of philosophy to its history, Ayers says that, being 
intended primarily for philosophers, the book “is probably not his- 
torical enough” in its purposes to suit intellectual historians, but that 
the result “is probably too historical to please many philosophers” 
(I, 3-4). This need not mean that neither historians nor philo- 
sophers could be satisfied by it, for besides the chapters on Locke’s 
philosophical antecedents and surroundings (very useful to the stu- 
dent of any seventeenth-century philosopher), and those on details 
of Locke’s own arguments, there are long stretches of pure and 
unadulterated philosophy. What it unfortunately may mean is that 
the material of this latter kind may not only be passed over by intel- 
lectual historians, but also missed by philosophers who might never 
dream of picking up a book called “Locke.” Although it manages 
nicely to combine an analytic approach to “the arguments of the 
philosophers” (the title of the series for which it was written) with 
some very illuminating history, much of the philosophy in it is stimu- 
lated by, rather than done along with Locke. 

Ayers’s aim is to “recover the meaning” (II, 1) of Locke’s Essay, 
and to assess what emerges by comparison with relevant historical 
rivals and also “current” theories. It is wry understatement to say 
that this comparison is “not always to the advantage of the present” 
(II, 1). A very valuable, and certainly one of the most characteristic 
and striking things about the book, is the extent to and manner in 
which Ayers comes to the defense of Locke. This is not a simple 
matter of straightforward sympathy as might be expressed in an 
attempt to explain, from an historically enlightened perspective, 
how Locke could have come to say things that might now seem 
patently unacceptable. It is, rather, a matter of Ayers’s believing 
(and this is part of his defense of doing history of philosophy at all) 
` that we can learn from the past—not, let it be stressed, by way of 
identifying mistakes to avoid repeating, but by way of recovering lost 
truths. “The past is an invaluable repository of unfashionable truth 
for those with the sympathy, judgement and patience to discern and 
make use of it” (I, 9). 

In effect, each of the two volumes has as its theme a truth of 
which recent philosophy has lost sight, and which the study of Locke 
helps us to regain. A continuing theme of the first volume is the 
rightness of Locke’s theory of knowledge to ascribe a central role to 
subjectivity or consciousness. Along with what he finds in Locke, 
Ayers argues that perceptual beliefs have to be understood as being 


438 THE JOURNAL OF PHILOSOPHY 


such that their source and objects are present to consciousness in a 
direct and perspicuous manner. This is not to say that they are 
infallible, and any suggestion of Locke’s that they are ‘‘evident’’ 
needs carefully to be detached from this. Yet it is, says Ayers, a 
perfectly reasonable rejection of the idea of guaranteed knowledge 
deliverance which is partly responsible for subjectivity’s being left 
out of modern accounts of knowledge. ‘The chief, and for some the 
only, argument for accepting the phenomenon of consciousness has 
been its obvious existence, rather than its indespensability for ex- 
plaining the possibility of knowledge” (I, 193). The idea of primary 
and direct knowledge that has an “evidence” or “‘perspicuousness”’ 
leads to a defense of ‘‘a qualified form” (I, 268) of Locke’s intuition- 
ism about a priori knowledge, with its talk of “seeing” necessary 
connections between ideas. It leads also to a criticism of twentieth- 
century “‘linguistic” accounts of such knowledge. 

The continuing theme of the second volume (which has already 
made itself felt in the background in the first) is a wholehearted and 
energetic rejection of “linguistic idealism” or “conceptual relativ- 
ism.’’ Ayers argues that language- and concept-independent reality 
itself “gives” us distinct objects. He argues for a strong and objec- 
tive contrast, of a sort which, he says, Locke “‘accepted, stated, and 
defended” between ‘‘substances, which are the real and natural uni- 
ties, and non-substances which are in some sense abstractions or 
creatures of the mind” (II, 10). One the treasures of this book is 
Ayers’s long and detailed, intellectually very rich and sensitive, ex- 
ploration and development of this distinction between ‘‘substances”’ 
and ‘‘modes.” 

None of this is to say that Locke got it all right. What Ayers 
develops and defends are theses “‘more or less analogous” (II, 2) to 
Locke’s. Locke wrongly assumed that the perspicuity of primary 
knowledge provides a criterion of truth, and the twentieth century is 
right to reject that assumption. What is needed is “authoritative 
perpiciousness without infallibility’’ (II, 3). Moreover, Locke’s 
causal theory of representation robs experience of any intentiona- 
lity. Modern thought is wrong to approve this in Locke, wrong not 
to see that it is conscious thought, and not language, or behavior, or 
functional states of the brain, which is the bearer of intentionality. 
For Locke, sensations are “blank effects” that represent things only 
in virtue of being caused by them; whereas, argues Ayers, they are 
conscious states which have an intentional content. Only the “in- 
trinsic intentionality of sensation, can explain the essential epistemo- 
logical role of sense-experience or make it seem anything but 
an accident that sensations occur in the process which gives 
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rise to perceptual knowledge” (I, 172). Furthermore, Locke’s quasi- 
perceptual intuitionism about necessary truth fails to explain our 
apprehension of modality. 

As for the objective distinction between the categories of sub- 
stance and mode, Locke does not always deal with it property. So, 
far from anticipating Saul Kripke, his account of their names 
wrongly supposes that the belief that substances have real essences is _ 
“semantically inert” (II, 78). He failed also in not giving due theoret- 
ical weight to the category of “natural modes’’—a category that is 
produced by a natural/nonnatural cut across the substance/mode 
distinction. (This category includes natural but nonsubstantial . 
things, such as measles or photosynthesis, as opposed to Locke’s 
nonnatural modes, such as triangle or property; and it is one about 
which Ayers is most interesting.) This latter failure of Locke’s stems 
from his trying to account for the “natural unity” of substances as 
opposed to the mind-given unity of modes, in terms of a principle of 
causality—a unifying causality which, evidently, is there with natural 
modes, too. Though “not entirely mistaken” (II, 128), this causal 
unity is more superficial than what is required, namely, a conception 
of material unity. We need, Ayers argues, a serious idea of the mate- 
rial coherence of a lump of stuff—an idea that Locke conflated with 
the different idea of quantity of stuff with the result that, along with 
more recent philosophy, he wrongly supposed that entities could be 
constructed by fiat by the arbitrary addition of principles of identity. 
Against their downgrading and underestimation by philosophers of 
all ages, Ayers defends materially unified chunks, such as rocks, as 
crucially important ontological items. 

One feature of the way in which, having recovered his meaning, 
Ayers wishes to assess Locke, is to take his arguments “not just one 
by one, but as a whole,” and to make a response which is as “system- 
atic as the philosophy under consideration” (I, 5—6). He does not, 
then, attempt to repeat or explain, either in detail or in general, the 
surface structure of the Essay as it presents itself to someone less 
knowledgeable than Ayers. From the very outset, he clearly assumes 
his readers will already be rather familiar with the text. The discus- 
sion is usually so far below its surface as to be rather unsystematic in 
terms of the Essay’s books, chapters, and sections. A somewhat star- 
tling symptom of this is that the last ten or twelve percent of the 
whole is devoted to Locke’s chapter on identity. This is a chapter 
that was unknown in the Essay as first published, and it is, Ayers has 
to concede, in some respects “an awkward afterthought.” But in 
other respects, he says, it is “a natural culmination of Locke’s sys- 
tem” (II, 205); and his discussion of it does indeed come, not merely 
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at the end of, but as a climax to, the whole work. It has to be said 
that in the light of Ayers’s endeavors to read Locke “systematically” 
one can see how it would be possible to be a “‘Lockean” in the way it 
always has been to be a “Kantian.” 

Throughout the book there is little dialogue with recent writers 
on Locke. On more than one occasion, “some commentators” are 
mentioned without being identified or located. No doubt this has 
something to do with the fact that Ayers is not taking Locke’s argu- 
ments “one by one.” But it unfortunately harmonizes with the sense 
of a mannered disdain which is produced by the provocative lofti- 
ness of the all too frequent dismissive references to the “fashion- 
able,” “the currently popular,” “the latest thing.” These references 
go from being annoying to being ludicrously parochial when, having 
found someone whose thought has recently converged with his, 
Ayers comments, on the very last page: “it must be the Oxford air, 
although not every argument out of Oxford is as cogent as this one 
seems to be” (II, 238). 
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RATIONALITY WITHIN REASON* 


uppose I need to decide whether to go off to fight for a cause 

in which I deeply believe, or stay home with a family which 

. needs me and which I deeply love. What should I do? My 

friends say I should determine the possible outcomes of the two 

proposed courses of action, assign probabilities and numerical utili- 

ties to each possibility, multiply through, and then choose whichever 
alternative has the highest number. 

My friends are wrong. Their proposal would be plausible in games 
of chance where information on probabilities and monetarily de- 
nominated utilities is readily available. In the present case, however, 
I can only guess at the possible outcomes of either course of action. 
Nor do I know their probabilities, or how to gauge their utilities. 
The strategy of maximizing expected utility is out of the question, 
for employing it requires information that I do not have. 

Nevertheless, my friends have not given up trying to help, and so 
they point out that I could simulate the process of maximizing ex- 
pected utility by assuming a set of possible outcomes, estimating ` 
their probabilities, and then making educated guesses about how 
much utility they would have. I could indeed do this, but I decide 
not to, for it occurs to me that I have no reason to trust the formula 
for maximizing expected utility when I have nothing but question 
marks to plug into it. Better strategies are available. Explaining what 


they are is the purpose of this paper.’ 


* I thank Simon Blackburn, Jim Child, Jules Coleman, Tyler Cowen, Peter Dan- 
ielson, Walter Glannon, Mark Kingwell, Ruth Marcus, Mark Migotti, Christo 
Morris, Jan Narveson, Alan Nelson, Philip Petut, Ken Presting, Mark Ravizza, 
Mike Resnik, Lamie and John Ross, Stephen Scott, Michael Slote, Elizabeth Wik 
lott, and audiences at Cornell and the University of North Carolina/Chapel Hill 
for especially helpful comments. 

1 Further questions concern the relationship between rational choice and moral 
agency. Do we have reasons to be moral? If so, how do they relate to our reasons 
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I RATIONAL CHOICE 

This section distinguishes between optimizing and satisficing strate- 
gies, and between moderate and immoderate preferences. The fol- 
lowing three sections discuss, in turn, when satisficing strategies 
are rational, when they are not, and when cultivating moderate pref- 
erences is rational. Later sections offer a way of characterizing 
rational choice in situations where the agent’s alternatives are 
incommensurable. 

In the simplest context, one has a set of alternatives clearly ranked 
in terms of their utility as means to one’s ends. If one is an opti- 
mizer, one chooses an alternative that ranks at least as high as any 
other. In contrast, if one is a satisficer, one settles for any alterna- 
tive one considers satisfactory. In this static context, though, it is 
hard to see the point of choosing a suboptimal alternative, even if it 
is satisfactory. 

In a more dynamic and more typical context, we are not pre- 
sented with a set of nicely ranked alternatives. Instead, we have to 
look for them, judging their utility as we go. In this context, opti- 
mizing involves terminating a search for alternatives upon conclud- 
ing that one has identified the best available alternative. Although 
optimizing involves selecting what one judges is best, however, it 
need not involve judging what is best “all things considered,” be- 
cause sophisticated optimizers recognize that considering all alter- 
natives is not always worth the cost. There may be constraints (tem- 
poral, financial, and so on) on how much searching they can afford 
to do. A person who stops the search upon concluding that prolong- 
ing the search is not worth the cost is also employing an optimizing . 
strategy, albeit one of a more subtle variety.” 

Satisficing, in contrast, involves terminating the search for alter- 








to be rational? Although I will not address these questions in this paper, they are 
ultimately the Bee which concern me most and which motivate the larger 
project of which this paper is a part. 

? Thus, if options emerge serially, one could choose an option in preference to 
the alternative of searching for something better with po a of ever find- 
ing it. The difference between satisficing and this more subtle kind of optimizing 
has to do with what the two strategies take into account in reaching a stopping 
point. At any pomt in the search, we may let the expected utlity of stopping the 
search equal U; the utility of the best option we bave turned up 20 . The 
expected utility of continued search equals the probability of ing a better 
option, P(fbo), muluplied by the utility of finding a better option, U(/bo), minus 
the cost of further search, C(fs) At some point, the satisficer stops because he 
believes U is good eno In contrast, the subtle optimizer stops because she 
believes P(/bo) U(fbo)-C(/s) 1s less than zero Even if the two stopping rules happen 


to CORET on the same stopping point, they do so for different reasons and 
require different information 
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natives upon concluding that one has identified a satisfactory alter- 
native. What distinguishes satisficing from optimizing in the dy- 
namic context is that the two strategies employ different stopping 
rules.° Unlike the optimizer, who stops searching when she has ei- 
ther considered all her options or has run up against things like time 
constraints, the satisficer stops the search upon identifying an alter- 
native as good enough.‘ For example, suppose you enter a cafeteria 
seeking a nutrjtionally balanced and reasonably tasty meal. You walk 
down the cafeteria line surveying the alternatives. Lf you are satisfic- 
ing, you take the first meal that you deem nutritionally and aestheti- 
cally adequate. If you are optimizing, you continue down the line 
surveying alternatives until you reach the end of the line or run out 
of time. You then take the meal that you consider optimal, either in 
comparison to the other known options or in comparison to the. 
alternative of further search. A satisfactory meal may or may not be 
optimal. Likewise, as cafeteria patrons know only too well, the best 
available meal may or may not be satisfactory. Of course, if you 
switch from one stopping rule to the other, you might end up 
choosing the same meal, but you will be choosing it for a different 
reason. So, neither rule is reducible to the other. (One could employ 
both stopping rules simultaneously, of course, resolving to stop as 
soon as one finds a satisfactory alternative or runs out of time or has 
considered all available alternatives—whichever comes first.) Nor 
can satisficing be equated with the more subtle kind of optimizing 
that takes the cost of searching for more-than-satisfactory alterna- 
tives into account; satisficers select the satisfactory alternative be- 
cause it is satisfactory, not because they calculate that stopping the 
search at that point would maximize utility. 


3 I borrowed the term ‘satisficing’ from Michael Slote, Beyond Bajona Opening: A 
Study of Rational Choice (Cambridge: Harvard, 1989), p. 5, but my 
tion of satisficing differs from his. It turns out that ion ah Satish. 


as a sto Tule was anticipated by Herbert Simon ın Models of Thought 

ew Haven: Yale, 1979), p. 3. In the same passage, Simon also anticipates one of 
my main complaints about maximization strategies, namely, that they require in- 
formation that decision makers often do not have 

* I do not know of any precise way to characterize ‘good enough’. Options that 
promuse disease, impnsonment, or premature death are typically held in low es- 
teem, however, so the notion has certain objective elements. But what peo 
consider good en ough also seems relative to expectations. As expectations 
the standards by which an option is judged good enough also tend to rise. This 
fact can be tragic, for it can rob people of the ability to appreciate how well their 
See eee ee orun, it is rational to set goals with an 

€, 


eye to what is attainab eera Rise a es become attainable. 
But raising the standard by one deems one’s si on satisfactory is harder 


to fathom. Perhaps cen are baie rene incapable of aming at higher ae 
without simultaneously reformulating cre of what is satisfactory. I do 


not know. 
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With this characterization of satisficing in mind, we can now clar- 
ify the difference between satisficing and moderation. Consider that 
being moderate is contrasted, not with optimizing, but with being 
immoderate. The distinction between being moderate and being im- 
moderate is a distinction concerning how much one wants. It is a 
distinction between different ends rather than between different 
ways of pursuing them. Being a satisficer does not entail that the 
bundle of goods one would deem satisfactory is of moderate size. A 
person could be both a satisficer and an immoderate, for a given 
satisficer may have wildly immoderate ideas about what counts as 
satisfactory. (Consider a person whose goal in life is to be a million- 
aire by the age of thirty.) Likewise, that one is an optimizer does not 
entail that one is immoderate. A person could easily be both a mod- 
erate and an optimizer, for the bundle of goods required to maxi- 


mize her satisfaction may well be of moderate size. 
Il. WHEN SATISFICING IS RATIONAL 


There is an apparent incongruence between the theory and practice 
of rational choice. Theory models rational choice as optimizing 
(sometimes even “all things considered”’) choice, yet in practice satis- 
ficing is ubiquitous. We could explain the incongruence away by 
saying that when people think they are looking for something satis- 
factory, what they are really looking for is something optimal. But I 
think satisficing can be reconstructed as a subtle kind of optimizing 
strategy only on pain of attributing to people calculations they often 
do not perform (and do not have the information to perform) and 
intentions they often do not have. This section tries to explain satis- 
ficing in terms of thought processes that we can recognize in our- 
selves. Satisficing will emerge as a real alternative to optimizing, and 
thus as a strategy that can be evaluated, criticized, and sometimes 
redeemed as rational. 

We begin with the observation that people have a multiplicity of 
goals. For example, a person can desire to be healthy, to have a 
successful career, to be a good parent, and so forth. Some goals are 
broad and others narrow, relatively speaking. While a given pair of 
goals might have little to do with each other, it is also true that a 
given goal might be encompassed by another. In other words, the 
point of the narrower goal—the reason for its being a goal—is that 
it is part of what one does in pursuit of a broader goal. For example, 
I might want to upgrade my wardrobe because I care about my 
appearance because I want a promotion because I care about my 
career. Suppose I find that the multiplicity of goals that I pursue are 
ultimately encompassed and rationalized by a goal of making my life 
as a whole go well. To mark the difference in breadth between my 
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concern for my life as a whole and my concern for particular aspects 
of my life (such as my health or my career), let us say one seeks a 
local optimum when one seeks to make a certain aspect of one’s life 
go as well as possible. One seeks a global optimum when one seeks 
to make one’s life as a whole go as well as possible.’ 

Note that optima can be defined as such only within the context 
of the constraints under which goals are pursued. (Thus, when 
economists speak of maximizing utility, it goes without saying that 
they are talking about maximizing utility subject to a budget con- 
straint.) We pursue goals subject to the limits of our knowledge, 
time, energy, ability, income, and so on. More intriguing, however, 
is that we often make decisions under conditions where the set of 
constraints imposed on us by external circumstances is not deter- 
minate enough to yield a well-defined optimum. When this happens, 
we often impose further constraints on ourselves. 

For example, in fleshing out.the problem of locating a house, we 
have to make some prior decisions. We decide how long to look, how 
much money to spend, what neighborhoods to consider. We knock 
only on doors of houses displaying ‘‘for sale” signs rather than on 
every door in the neighborhood. To some extent, these constraints 
are imposed on us by mundane external factors, but they also have a 
striking normative aspect, for they are in part rules of conduct we 
impose on ourselves; we take it upon ourselves to make our con- 
straints more precise and more limiting so as to make our choice set 
more definite. Local optimizing would often be neurotic and even 
stupid if local goals were not pursued within compartments partly 
defined by self-imposed constraints. The constraints we impose on 
our narrower pursuits can keep narrower pursuits from ruining the 
larger plans of which they are part.® 

In other words, if we look at life as a whole, we can see that life as 
a whole will go better if we spend most of it pursuing goals that are 


5I borrowed the terms ‘local’ and ‘global’ from Jon Elster, although when 
rereading his text I noticed that the way he uses the terms bears little resemblance 
to the they are used here. He says the definitive difference between locally 
and glo maximizing machines is that the latter, unlike the former, are capable 
of wasting and tndtrect strategies See Elster, Ul and the Sirens: Studies m 

and Irrationality (New York: Camb , rev. 1984), p. 9 

° Jules Coleman has pomted out to me that what David Gauthier (Morals By 
Agreement (New York: Oxford, 1986), p. 170) calls constrained maximization 18 a 
particularly interesting kind of local optimizing under self-imposed constraints 
Constrained maximizers seek maximum payoffs in prisoner's dilemmas subject to 
this constraint: they will cooperate (and thus pass up the opportunity to unilater- 
ally defect) uf the expected payoff of cooperating is hi than the known payoff 
of mutual defection, which it will be if and only if they expect their partners to 
cooperate. 
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narrower than the goal of making life as a whole go better. That is 
why it is rational to formulate and pursue local goals. But it is also 
rational to prevent narrower pursuits from consuming more re- 
sources than is warranted by the importance (from the global per- 
spective) of achieving those narrower goals. Accordingly, we pursue 
narrower goals under self-imposed constraints. 

Although the constraints we impose on ourselves are imposed 
from a more encompassing perspective, it is only within the 
narrower perspective that we become subject to self-imposed con- 
straints. (External constraints, of course, are salient even at the 
global level.) Self-imposed constraints can be applied only to 
narrower pursuits and can be applied only from the perspective of a 
more encompassing pursuit. Our broader objectives constrain what 
we are willing to spend on upgrading a wardrobe. They constrain 
what we will do for the sake of our appearance, or for the sake of a 
promotion, or for the sake of a career. 

Armed with the distinction between local and global optimization, 
we can now explain when satisficing is rational. Michael Slote be- 
lieves the optimizing tendency can be self-defeating. He says that 
“[a] person bent on eking out the most good he can in any given 
situation will take pains and suffer anxieties that a more casual indi- 
vidual will avoid. . .” (op. cit., p. 40). And he asks us to consider 
“how much more planful and self-conscious the continual optimizer 
must be in comparison with the satisficer who does not always aim 
for the best and who sometimes rejects the best or better for the 
good enough” (op. cit., p. 40). In short, that one has an opportunity | 
to pursue the good is not by itself a compelling reason to pursue the 
good. Surely, Slote has an important point. Just as surely, however, 
his point applies to local optimizing rather than to optimizing as 
such. From the global perspective, seeking local optima can be a 
waste of time. Global optimizers seek local optima only when doing 
so serves their purposes. Thus, satisficing is a big part of a global 
optimizer’s daily routine. Effort can have diminishing returns, so a 
global optimizer will be careful not to try too hard. Local optimizing 
often gives way to satisficing for the sake of global optimality. 

From the global optimizer’s point of view, the process of buying a 
house provides a good example of how satisficing can be rational. 
When we choose a house, we might proceed by seeking the best 
available house within certain constraints—within a one-month time 
limit, for example. We impose such a limit because we have goals 
other than living in a nice house. Looking for a house competes with 
our other goals for our time and energy. Or we might look for a 
satisfactory house and cease looking when we find one. Most of the 
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people I have asked say they would optimize within constraints, but 
would not deem satisficing irrational. Like local optimizing, it can 
serve our larger plans by setting limits on how much effort we put 
into seeking a house at the expense of other goals that become more 
important at some point, given the diminishing returns of remaining 
on the housing market. An optimizing strategy places limits on how 
much we are willing to invest in seeking alternatives. A satisficing 
strategy places limits on how much we insist on finding before we 
quit that search and turn our attention to other matters. 

The two strategies need not be inflexible. People sometimes have 
reason to switch or revise strategies as new information comes in. If 
I seek a satisfactory house in an unfamiliar neighborhood and am 
shocked to find one within five minutes, I may stop the search, 
acknowledging the stopping rule I previously imposed on that activ- 
ity. On the other hand, I may conclude that, since I formulated my 
aspiration level under unrealistically pessimistic assumptions, I 
should resume my search with a satisficing strategy revised to reflect 
a higher aspiration level. Or I may switch to a local optimizing strat- 
egy, spending another day or two looking at houses, and then taking 
the best I have found so far. Or I may do both, looking until I either 
reach my new aspiration level or reach my time limit. In this way, the 
two strategies are often interactive. 

Likewise, suppose I started out with the idea of seeking the best 
house I could find within a one-month time limit but have so far 
been terribly disappointed with my options. In this case, when after 
two weeks I finally find a house that meets my plummeting aspira- 
tion level, I may find myself embracing that sadder but wiser aspira- 
tion level as a stopping rule, abandoning my original plan to seek a 
local optimum with respect to a one-month time constraint. 

I suspect that the more concrete our local goals are, the more 
reason there is to satisfice. If we do not know exactly what we are 
looking for, then we usually are better off setting a time limit and 
then taking what we like best within that limit. But if we know exactly 
what we are looking for, then it is rational to stop searching as soon 
as we find something that fits the bill.” So, having detailed informa- 
tion about our goals weighs in favor of using that information in 
formulating aspiration levels as stopping rules. Conversely, the more 
we know about our set of alternatives, the easier it is to identify 


day Rosen tells me that before he began looking for a house, he made a 
list of desirable features, telling himself he would take the first house that had 
85% or more of those features. As it happens, the first house he looked at scored 
85% He stopped looking, bought the house, and has lived there ever since 
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which alternative has the highest utility, which weighs in favor of 
seeking local optima. 

The stakes involved are also pertinent—indeed crucial. The less 
we care about the gap between satisfactory and optimal toothpaste, 
for example, the more reason we have to satisfice—to look for a 
satisfactory brand and stop searching when we find it. Note the 
alternative: instead of satisficing, we could optimize by searching 
among different brands of toothpaste until we find the precise point 
at which further search is not worth its cost. But an optimal stopping 
point is itself something for which we would have to search, and 
locating it might require information (about the probability of find- 
ing a better brand of toothpaste, for example) that is not worth 
gathering, given the stakes involved in the original search for tooth- 
paste. Against this, one might object to my assumption that we need 
precision in the search for an optimal stopping point. Why not seek 
to learn roughly when looking for better toothpaste is not worth the 
cost? In the search for a stopping point that we might graft onto the 
original search for toothpaste, it can be more rational to seek to be 
tolerably close to an optimum than to seek to be at an optimum. 

But that is my point: there are cases where one does not care 
enough about the gap between the satisfactory and the optimal to 
make it rational to search for the optimal. Searching for optimal 
toothpaste can be a waste of time, but so can searching for the 
optimal moment to quit looking for toothpaste. One way or an- 
other, satisficing enters the picture. There will be times when even 
the most sophisticated optimizing strategies will be inappropriate, 
for they require information that we may not have and whose acqui- 
sition may not be worth the trouble. And a less sophisticated ‘‘all 
things considered” strategy will nearly always be inappropriate. Ra- 
tional choice involves considering only those things which seem 
worthy of consideration, which is to say it involves satisficing, i.e., 
having a stopping rule that limits how comprehensive a body of 
information we insist on gathering before stopping the search and 
turning our attention to other matters. 

There is also something to be said for having a moderate disposi- 
tion—a disposition that allows one to be content with merely satis- 
factory states of affairs. Consider that starting a search too soon can 
be every bit as wasteful as stopping a search too late. Searching for a 
house is costly. It is costly partly because people have other goals; 
the time and energy you spend searching for a house could have 
been spent on other things. Even if you find a better house than you 
already have, the process of moving will also be costly. Moreover, it 
takes time living in and enjoying a house in order to recoup these 
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costs. If you move every month, you will always be paying the costs 
and never enjoying the benefits of better housing. Moving into a 
house is part and parcel of a decision to stay a while, for it is only in 
staying that you collect on the investment of time and energy you 
made in moving. The general lesson is that costly transitions to pre- 
ferred states of affairs require intervening periods of stability so that 
transition costs can be recovered and thus rationally justified. The 
stability of the intervening periods requires a disposition to be con- 
tent for a while with what one has—to find something one likes and 
then stop searching. 

Further, even if transition costs are relatively minor, there still can 
come a point when we should abandon the search for, say, a better 
job or a better spouse, not because such goals are unattainable or 
even because the transition costs are too high, but rather because 
such goals eventually can become inappropriate. At some point, we 
have to start collecting the rewards that come only when we make a 
genuine commitment—when we stop looking for something or 
someone better. Local optimization processes have their own ends, 
but initially adopting an end and creating a compartment within 
which to pursue it is itself a goal-directed activity and, from the 
standpoint of the global optimizer, not one to be engaged in frivo- 
lously. We need to be able to satisfice within various local compart- 
ments (those defining our searches for spouses, jobs, and so on) in 
order to be able to make our lives as a whole go as well as possible. 

II. WHEN SATISFICING IS NOT RATIONAL 

Slote says that “choosing what is best for oneself may well be neither 
a necessary nor a sufficient condition of acting rationally, even in 
situations where only the agent’s good is at stake” (op. cit., p. 1). 
For example, a person who is moving and must sell his house might 
seek, “not to maximize his profit on the house, not to get the best 
price for it he is likely to receive within some appropriate time pe- 
riod, but simply to obtain what he takes to be a good or satisfactory 
price” (op. cit., p. 9). Once the seller receives a suitable offer, he 
may rationally accept it immediately, even though there would be no 
cost or risk in waiting a few days to see if a higher offer materializes. 
“His early agreement may not be due to undue anxiety about the 
firmness of the buyer’s offer, or to a feeling that monetary transac- 
tions are unpleasant and to be got over as quickly as possible. He 
may simply be satisficing in the strong sense of the term. He may be 
moderate or modest in what he wants or needs” (op. cit., p. 18). 

Slote does not offer an analysis of rationality. Nor shall I. I do, 
however, offer this as a necessary condition of rationality: one’s 
choice is rational only if one did not recognize clearly better reasons 
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for choosing any of one’s foregone alternatives. This condition does 
not beg any of the questions that concern us here. It does not entail 
that rational choice is optimizing choice. Rather, it allows that one 
could rationally choose an alternative because it is satisfactory, ter- 
minating the search of one’s choice set at that point. Moreover, it 
also allows that, if one has two satisfactory alternatives, one could 
choose the more moderate of the two on the grounds that it satisfies 
a preference one happens to have for moderation. 

On the other hand, although a suboptimal option may be good 
enough to be worthy of choice in a given case, that does not mean it 
is worthy of being chosen in preference to something better. If one 
has two choices and one alternative is satisfactory but the other is 
not, then the satisfactory choice is rational because it is better. But 
suppose one has two choices and both are satisfactory. (E.g., sup- 
pose your house is for sale, and you simultaneously get two satisfac- 
tory offers, one for $200,000 and another for $210,000, and you 
prefer the larger offer.) In this case, one cannot rationalize choosing 
the inferior alternative merely by pointing out that the inferior al- 
ternative is satisfactory. The inferior option is satisfactory, but since 
this is not a difference, it cannot make a difference either. By hy- 
pothesis, the superior option is also satisfactory. 

Why, then, should we choose the superior option? Presumably 
because it is better. Whatever it is in virtue of which we deem that 
option superior is also a reason for us to choose it.? Oddly, Slote 
denies this. It can be rational to choose the inferior option, Slote 
insists. Nor do we need a reason to choose the inferior option, Slote 
argues, because rationality does not always require people to have a 


8 That is, an opumizer might choose a peeves option in preference to 
searchmg for better options that might never mat . Slote, however, says we 
intuitively recognize the rationality of taking the first satisfactory offer even in 
abstraction from the real-world risks and anxieties of having to sell one’s house 
(op. at, p 18). But my view is that, if we are going to talk about common sense 

owing the seller to accept the firm offer immediately even though the seller has 
the option of waiting a few Saywan hope ob «ues er, then we have to stick to 
the conditions under which common sense holds sway. We do indeed have intu- 
itions about what we should do m risky situations, but we cannot, as Slote wants to 
do, simply stspulate that our intuitions sg ta risky situations have nothing to 
do with the fact tbat in the real world such situations are risky In the real-world 
ho market, to turn down an entirely satisfactory offer m quest of something 
better 1s to court disaster, to so arm as it were This is one reason why it is 
common sense, and rationally exp le common sense, for a global optimizer ta 
be hesitant about ing down a satisfactory offer. Even from a local ive, 
the expected gain from search may not be worth risking the potential loes. 

° Philip Pettit makes the same point in “‘Satisficing Consequentialism,” Proceed- 
ings of the Aristotelian Society, supplement to LVIN (1984): 165-76, here p. 172 
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reason for choosing one alternative rather than another (op. cit., p. 
21). For example, a rational person might grab a blouse out of her 
closet in the morning without being able to explain why she chose 
that one over the similar blouses hanging beside it. To call her irra- 
tional simply because she cannot explain her choice would be a 
mistake. 

I agree with this, as far as it goes; not all choices have to be or can 
be explained. To deem a choice rational, however, is to imply there 
is an explanation of a certain kind. A person can be rational without 
being aware of reasons for everything she does, but the things she 
does for no reason are not rational, and we do not show them to be 
rational merely by pointing out that they were done by a rational 
person. The person who simply grabs a blouse may be choosing, 
perhaps rationally, to forgo the opportunity to choose rationally 
which of her several blouses she ends up wearing. If she is running 
late for the train, then under the circumstances anything that counts 
as a blouse will also count as satisfactory, so she leaves to impulse the 
selection from her set of blouses. (In this case, the process of 
searching among alternative blouses virtually vanishes—there is 
hardly any choice at all. If she instead gives herself a few seconds to 
make sure she avoids blouses with valentines or political slogans on 
them, then she will be choosing within a small but real local com- 
partment.) 

Thus, it can be rational to forgo the opportunity to choose ratio- 
nally, because searching for reasons to choose one alternative over 
another might be a waste of time from a broader perspective. Never- 
theless, it definitely is a condition of rational choice that the choice 
in question is made for a reason. Deliberately choosing the worse 
over the better is irrational, and this is not mitigated by the fact that 
rational people sometimes leave their choices to impulse. 

Rational-choice theory can tell us a story about why the commuter 
finds herself going to work in a green blouse with orange polka dots, 
but the story will require an implicit or explicit distinction between 
more and less encompassing perspectives. Without the distinction, 
an optimization story would be blatantly false, for she does not in 
fact choose the optimal blouse, and a satisficing story would have 
neither explanatory nor justificatory power, for the point of choos- 
ing a merely satisfactory blouse when better ones were available 
would remain a mystery. To be able to see the point of what she does 
at the local level, we have to be able to step back and look at her 
actions from a broader perspective. From a broader perspective, it 
may be perfectly reasonable for her to cut short a particular search 
because she was looking for something satisfactory and found it, but 
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it cannot be rational to choose something because it is satisfactory 


while at the same time recognizing a clearly better alternative. 
IV. WHEN MODERATION IS RATIONAL 


As I said earlier, we cannot choose between two satisfactory alterna- 
tives on the grounds that one of them is satisfactory. We could, 
however, choose on the grounds that one of them is more moderate. 
Consider an example of Slote’s. He says it “makes sense” for some- 
one to desire “to be a really fine lawyer like her mother, but not 
desire to be as good a lawyer as she can possibly be. This limitation 
of ambition or aspiration may not stem from a belief that too much 
devotion to the law would damage other, more important parts of 
one’s life. In certain moderate individuals, there are limits to aspira- 
tion and desire that cannot be explained in optimizing terms. . .” 
(op. cit., p. 2). 

Common sense indeed recognizes that moderate aspirations can 
be rational, but to note this fact in an off-the-cuff way is hardly to 
provide an explanation of moderate aspirations. Our common-sense 
recognition is precisely what has to be explained. If all we have is an 
intuition that an act makes sense, but cannot say what the act makes 
sense in terms of, then we would be jumping to conclusions if we 
said we were approving of the act as rational. In contrast, if we 
explain a show of moderation in terms of its conduciveness to over- 
all satisfaction, then we have explained it as rational. We have not 
merely claimed that it makes sense; rather, we have actually made 
sense of it. We have shown that we had-reason to choose as we did, 
while not having better reasons to choose differently. 

How, then, might we explain having moderate career goals? First, 
there is the issue of trade-offs mentioned by Slote. One might culti- 
vate an ability to be content with moderate career goals, not because 
one prefers moderate success to great success, but because one 
cares about things other than success. Thus, one point of cultivating 
modest desires with respect to wealth is that it might improve a 
person’s ability to adhere to a satisficing strategy with respect to 
income, thus freeing herself to devote time to her children, her 
health, and so on. 

There are also ways in which moderation can have instrumental 
value that do not depend on the need to make trade-offs. There can 
be reasons for striving to be as good a lawyer as one’s mother even if 
one wants to be as good a lawyer as possible. For example, a person 
might aim at being as good as her mother as a stepping stone to 
becoming the best lawyer she can be. The modesty that enables a 
person to concentrate on successfully making smaller steps may 
eventually put her within reach of something more lofty. There is 
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also value in concreteness. A person may have no idea how to go 
about becoming the best possible lawyer, but may have a much 
clearer idea about how to become as good as her mother because 
the more modest goal is more concrete. Further, even given two 
equally concrete goals, an optimizer might very well choose the 
lesser on the grounds that only the lesser goal is realistic. Thus, one 
might become a better lawyer by emulating one’s highly competent 
mother than by wasting one’s time in a fruitless attempt to emulate 
her superstar partner. 

Finally, we can at least conceive of moderation being a preference 
in itself—not just a quality of a desire but itself the thing desired.!° 
One might explain the cultivation of such a preference on the 
grounds that moderation is less distracting than extravagance, with 
the consequence that the moderate life is the more satisfyingly 
thoughtful and introspective life. In various ways, then, moderation 
can have instrumental or even constitutive value from the global 
perspective. Insofar as moderate preferences can be deliberately 
cultivated, their cultivation is subject to rational critique, and can 
thus be defended as rational. 

V GLOBAL PERSPECTIVE AND UNDERDETERMINED RATIONAL CHOICE 
Like moderate preferences, satisficing strategies can be of instru- 
mental value from the global perspective. Satisficing strategies, how- 
ever, can be of instrumental value only. This is because to satisfice is 
to give up the possibility of attaining a preferable outcome, and 
giving this up has to be explained in terms of the strategic reasons 
one has for giving it up. Local optimizing must likewise be ex- 
plained, for it too consists of giving something up, namely, the op- 
portunity to invest one’s efforts in some other compartment. 

Global optimizing, however, is not open to question and subject 
to trade-offs in the ways that local optimizing and satisficing are. 
Local goals can compete with each other, but there are no goals that 
compete with optimizing at the global level, at least not in the arena 
of rationality. A global optimum is not one among several compet- 
ing goals; rather, in encompassing our lives as a whole, it also en- 
compasses our competing goals. It represents the best way to resolve 
the competition from the standpoint of life as a whole. Local opti- 
mizing can be a waste of time from the global perspective, but global 
optimizing cannot. 

What, then, is the nature of the global perspective? Do we ever 
actually assume the global viewpoint or is this merely a theoretical 
postulate? The answer is that we can and do assume the global view- 


1° I thank Mark Ravizza for this point. 
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point every time we do what we call “stepping back to look at the big 
picture.” We do sometimes ask ourselves if the things we do to 
advance our careers, for example, are really worth doing. We do not 
spend all our waking hours looking at the big picture, of course. Nor 
should we, for when we look at the big picture, one thing we see is 
that we can spend too much time looking at the big picture. Reflec- 
tion is a crucial part of the good life, but it is only a part. Part of 
attaining a global optimum involves being able to lose ourselves for 
a time in our local pursuits. 

To speak of attaining a global optimum, however, and to endorse 
global optimizing as a normative ideal, is not to presume that there 
is an algorithm for identifying global optima. In fact, not every set of 
alternatives even contains a well-defined optimal choice, let alone 
one that can be easily identified. To borrow a fanciful example from 
John Pollock," suppose you are immortal, and are also fortunate to 
have in your possession a bottle of ‘‘EverBetter Wine.” This wine 
improves with age. In fact, it improves so steadily and so rapidly that 
no matter how long you wait before drinking it, you would be better 
off, all things considered, waiting one more day. The question is: 
When should you drink the wine? 

A rational person presumably would simply drink the wine at 
some point (perhaps after artificially constraining himself to drink 
the wine by year’s end, and then picking New Year’s Eve as the 
obvious choice within that time frame), but the person would not be 
able to defend any particular day as an optimal choice. Indeed, it is 
built into the example that no matter what day the immortal 
chooses, waiting one more day would have been better. There are 
no constraints with respect to which he can regard any particular 
day as the optimal choice, unless he imposes those constraints on 
himself. 


There is something about choosing New Year’s Eve that is rational, 
but it consists of something other than how that day compares to the 
alternatives. Although the immortal could not defend choosing New 
Year’s Eve in preference to waiting one more day, the choice is 
defensible in the sense that he did not have a better alternative to 
picking something or other. Indeed, picking something or other was 
the optimal selection from the set of options whose only other 
member consisted of sitting on the fence forever. The distinction 
between local and global optimizing thus allows us to explain with- 
out paradox the sense in which choosing New Year's Eve was 


11 “How Do You Maximize Expectation Value?” Notis, xvm (1984): 409-21, 
here pp. 417ff. 
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in our lives only when they develop a certain autonomy, when we 
pursue them not as means of making our lives go well but as ends in 
themselves. We begin to tap the capacity of our ideals, our spouses, 
and our children to enrich our lives only when we acknowledge 
them as having value far beyond their capacity to enrich our lives. 
(Cherishing them becomes more than an instrumental means of 
making life go well; it becomes constitutive of life going well.) And 
goals that we come to cherish as ends in themselves inherently tend 
to become incommensurable with each other. We may, for instance, 
find ourselves in a position where we cannot fight for a cause in 
which we deeply believe without compromising the care which our 
loved ones need from us and which we wholeheartedly want them to 
have. Nevertheless, this is the price of the richness and complexity 
of a life well-lived. To have both ideals and loved ones is to run the 
risk of having to make decisions under unresolved conflicts of value. 

Because some of our values are incommensurable, we sometimes 
have no method by which to identify optimal trade-offs among con- 
flicting local goals. In such cases, the goal of making life as a whole 
go as well as possible remains meaningful, although there may not 
be any course of action that unequivocally counts as pursuing it. 
Even if would-be global optimizers cannot identify optimal options, 
they can still reject alternatives that fail to further any of their goals. 
In particular, if no better way of resolving the conflict emerges, 
simply picking something or other will emerge as optimal compared 
to the alternative of remaining on the fence, for we eventually reject 
fence sitting on the grounds that it fails to further any of our goals. 

This may seem a grim picture of rational choice at the global level, 
but there are two points to keep in mind. First, when faced with a 
situation in which we must simply pick something, we are likely to 
have regrets about paths not taken, but we naturally adapt to the 
paths we take, and regret can fade as we grow into our choice. Thus, 
an alternative somewhat arbitrarily picked from a set within which 
no optimum exists can eventually come to be viewed as optimal from 
the perspectives of people we are yet to become, even if it could not 
be considered optimal at the moment of choice. Second, this discus- 
sion of underdetermined rational choice concerns a worst-case sce- 
nario. Global optimizers carry out the highest-ranked life plan when 
they have one. Often, however, there is no highest-ranked plan for 
life as a whole and thus no well-defined global optimum; there is 
only a need to cope with competing and sometimes incommensur- 
able local goals. In the worst case, no course of action unambigu- 
ously qualifies as making life as a whole go as well as possible, except 
insofar as it is unambiguously better to move in some direction 
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rather than none. But this gives us enough to avoid paralysis even in 
the worst case. By hypothesis, simply picking something emerges as 
the best the agent can do, and thus to pick something is to optimize 
with respect to the choice of whether to spend more time sitting on 
the fence. 

It would be natural to say rational choice is choice “all things 
considered.” The trouble is that we often find ourselves not know- 
ing what to consider, and it would be bad advice to tell us to con- 
sider all things. We can consider all things within a limited range, 
perhaps, but the limits of that range will themselves tend to .be 
matters of choice in large part. We start out knowing that in some 
sense we want each aspect of our lives to go as well as possible, yet 
Wwe realize that our resources are limited and that our various pur- 
suits must make room for each other. When looking at our lives as a 
whole, what is most clear is that rationally managing a whole life 
involves managing trade-offs among life’s various activities. If the 
benefits that will accrue from our various pursuits are known and 
commensurable, then managing the trade-offs is easy, at least theo- 
retically; we simply maximize the sum of net benefits. In many of the 
everyday cases I have been discussing, however, the benefits are 
neither known nor commensurable with other benefits. Even so, we 
can effectively manage trade-offs among our particular pursuits by 
setting limits on how much of our lives we will spend on particular 
pursuits. We can also set limits on how much benefit we insist on 
getting from particular pursuits. To impose the latter kind of stop- 
ping rule on a particular pursuit is to embrace what I have called a 
satisficing strategy. 

Both kinds of constraint play a role in rational choice. Why? Be- 
cause if we recognized only temporal limits, say, then we would 
automatically spend our full allotment of time in a given compart- 
ment even when we had already found an acceptable option. But if 
we also have strategically limited aspiration within that compart- 
ment, then finding an acceptable option will trigger a second kind of 
stopping rule. The second stopping rule closes the compartment 
and diverts the unused portion of the compartment’s time allotment 
to other compartments where our need to find an acceptable option 
has not yet been met. Cultivating moderate preferences may also be 
advantageous in a supplementary way insofar as moderate prefer- 
ences may help us adhere to the kind of limit we impose on a pursuit 
when we embrace a satisficing strategy. 

Against the idea that our most important goals tend to become 
incommensurable with each other, one might suppose our global 
end is simply to flourish or to be happy—and that our local goals 
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therefore must be commensurable in such terms. This would be a 
tidy climax to an otherwise rather untidy story about rational choice 
under unresolved conflict of values, but the tidiness would be super- 
ficial. One hardly gives people an algorithm for resolving conflicts 
when one advises them to be happy. What makes such advice vacu- 
ous, I think, is that flourishing and being happy cannot be concrete 
goals at the global level in the way that finding a house can be at the 
local level. Of course we want to flourish, but we aim to flourish 
only in an especially metaphorical sense. The fact is that we flourish 
not by aiming at flourishing but by successfully pursuing things 
other than flourishing, things worth pursuing for their own sake. 

Likewise, happiness can be a standard by which a life as a whole is 
judged, perhaps, but it cannot be a goal at which a life as a whole is 
aimed. We do not become happy by pursuing things there would 
otherwise be no point in pursuing. Rather, there must be a point in 
striving for a certain goal before striving for it can come to have any 
potential to make us happy. To aim at happiness is to aim at a 
property that can emerge only in the course of aiming at something 
else.!® So, the point about happiness and flourishing leaves us where 
we started, having to choose among things we value for their own 
sake, hoping we will be happy with our choice. 

I might add that happiness derives from a variety of local sources, 
and the different elements of a person’s happiness are not inter- 
changeable. We can find happiness in our careers or in our 
marriages, but the vacuum left by a shattered career cannot be filled 
by domestic bliss. Our various local pleasures are not fungible. They 
do not admit of an overall measure suitable for use at the glo- 
bal level. 

VI. THE BROADEST PERSPECTIVE 

The global perspective is the perspective that encompasses our lives 
as a whole. Decision making at this level disciplines the amount of 
time we devote to particular local compartments. I think we are 
capable of taking a perspective this broad even in worst-case sce- 
narios where there is no well-defined global optimum. But even if 
we suppose that we can take a perspective encompassing our whole 
lives, why should we suppose that this is the broadest perspective we 
can take? 


18 As Bernard Williams puts it, one has to want other things for there to be 
anywhere that happiness can come from. See “A Cntique of Utilitarianism,” in 
d: C. Smart and Bernard Wiliams, Uttlitananism: For and Against (New 

ork Cambridge, 1973), p. 113. 

16 I thank Nick Sturgeon for a discussion from which this point emerged 
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Perhaps there can be broader perspectives than what I have called 
the global perspective. Indeed, in the larger project of which this 
paper is a part, I hope to show that we do have access to a larger 
perspective, that there are some aspects of morality that we cannot 
appreciate except from this larger perspective, and that it can be 
rational to try to achieve this perspective. On the other hand, I think 
it would be highly unrealistic to suppose there is an infinite regress 
of levels. There is no need to prove that an infinite regress is impossi- 
ble, but I would like to be able to show that my theory does not 
presuppose an infinite regress. 

The threat of infinite regress arises in the following way. I said we 
cannot spend all our time looking at life as a whole; we must be able 
to lose ourselves (or perhaps I should say, find ourselves) in our 
local pursuits. How much time, then, should we spend pondering 
conflicting values? How much time should we spend looking at life 
as a whole? From what perspective do we decide what constraints to 
apply to the amount of time we spend looking at our lives from the 
global perspective? Perhaps we need a “‘super-global”’ perspective in 
order to answer these questions. After all, how could we decide how 
much time to spend at a given level unless we did so from a still 
more encompassing perspective? It seems that my theory can ex- 
plain the time we allot to a given perspective only by supposing that 
we retreat to a broader one, ad infinitum. 

But the theory presumes no such retreat. There are simpler, more 
realistic ways to explain the amount of time we spend looking at life 
as a whole.” First, there are things, like sleeping, that we do as the 
need arises; since we do not decide how much time to spend sleep- 
ing, we do not decide from a broader perspective, either. Indeed, 
we might be better off sleeping as we feel the need rather than 
trying to set aside a calculated amount of time for sleep. Perhaps the 
same holds true of the activity of looking at life as a whole. Insofar 
as our purpose in looking at life as a whole is to resolve conflicts 
arising between various aspects of our lives, so that life as a whole 
may go as well as possible, there will come a time when taking a 
global perspective has served its purpose. At that time, the compart- 
ment reserved for the activity of resolving local conflicts naturally 

closes until subsequent conflict forces it open again. There is no 


1 The simplest way to explain the amount of time spent at the global level 
would be to say that we take whatever time we need to consider everything. The 
trouble is that we do not have ome to consider everything that might be relevant 
to life as a whole, any more than we have time to consider everything that might 
be relevant to the ing of a house. The explanation will have to be more 
complicated than , hence the line of thought pursued in the following text. 
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residual conflict awaiting resolution at a higher level. Thus, the ques- 
tion of how much time to spend in contemplation need not itself 
require contemplation. Rather, we take whatever time it takes to 
resolve a conflict, or else we reach a point where we must simply 
pick something. More generally, we stop contemplating when we 
judge that pursuing our local goals has come into conflict with— 
and has become more important than—the activity of thinking 
about how to juggle them. (For example, we would not dwell on the 
big picture if we were starving. Conflicts are rarely so important that 
contemplating them could pre-empt securing our immediate sur- 
vival.) In this scenario, we are driven to the global level by local 
conflict and eventually are driven from that level by a need to get on 
with our lives.’® 

But we can also imagine a second kind of scenario in which the 
amount of time we spend looking at life as a whole is determined by 
deliberate calculation, in the same way that we could imagine delib- 
erately calculating how much time to spend sleeping. Could we 
make a conscious decision of this kind without taking a super-global 
perspective? Yes, we could. Consider that contemplation is an activ- 
ity that must find its place in our lives along with other activities. For 
example, I might spend the month of June in a rented cabin, not 
doing anything to pursue my career, but just thinking about why I 
ever wanted to be a philosopher and about whether my original 
reasons still hold. This compartment in my life is reserved for con- 
templating my career. It is separate from the compartment or com- 
partments within which I actually pursue my career. I also have a 
compartment, similar in many respects, within which I contemplate 
life as a whole. But although the subject I contemplate is the whole 
of my life, the contemplation itself is not. The contemplation is only 
one of many activities about which I care. 

Now, if I need to decide how much time to reserve for contem- 
plating life as a whole, I take a global perspective, trying to gauge 
how important that activity is to my life as a whole. Notice, then, 
what is unique about the compartment I reserve for the activity of 
contemplating my life as a whole. The compartment is unique be- 
cause its boundaries are set by the activity that takes place within it. 
In the course of contemplating life from the global perspective, I 


18 We also can be driven to a global perspective by the resolution of conflicts 
Thus, when we finish a major project that had forced other pursuits to take a back 
seat, we often take time to evaluate self-imposed constraints and decide how to 
divide our extra time ena? previously neglected projects. And what drives us 
from the global perspective Is the eventual resolution of a local conflict between 
savoring big picture (a satisfying activity indeed when just finishing a major 
project) and the need, say, to start making dinner 
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decide how much time to reserve for any given activity, including 
contemplation in general and contemplating life as a whole in partic- 
ular. In this scenario, like the previous one, no boundary-setting 
issue is left to await resolution at a higher level. 

I have outlined two possibilities here. In one case, I use whatever 
time it takes to resolve conflicts, subject to pre-emption by activities 
that in the short run are more important than conflict resolution. In 
this case, no decision is required. The discipline is automatic. In the 
second case, I discipline the compartment from within, as the pro- 
cess of contemplating trade-offs culminates in time being reserved 
for contemplation along with other local activities. Therefore, I do 
not need a super-global perspective to decide how much time to 
reserve for the activity of taking the global perspective. Such deci- 
sions are precisely the kind I make from the global perspective itself, 
if I need to make them at all. Unless we introduce something that 
competes with the goal of making life as a whole go as well as possi- 
ble (such as, perhaps, the recognition of moral obligations), there is 
no reason to step back from a global perspective to something even 


broader. 
VII. CONCLUSIONS 


This paper sets out a normative ideal of rational choice that is suit- 
able for the kind of beings we happen to be, beings who would only 
hurt ourselves if we tried to maximize our overall utility in every 
waking activity. It defines satisficing and local optimizing as strate- 
gies for pursuing goals within constraints that are in part self- 
imposed. Satisficing emerges not as an alternative to optimizing as a 
model of rationality, but rather as an alternative to local optimizing 
as a strategy for pursuing global optima. 

Under normal conditions, we employ a combination of heuristics, 
such as (1) compartmentalizing our pursuits so as to narrow the 
scope of any particular optimization problem to the point where our 
limited knowledge becomes sufficient to identify an optimal solu- 
tion, (2) accepting self-imposed constraints for the same reason as 
well as to keep particular pursuits from pre-empting more impor- 
tant ones, and (3) satisficing, which has the effect of closing com- 
partments as soon as they serve the purpose for which they were 
created. Under normal conditions, where we lack the information 
we need to assign probabilities and utilities, this combination of 
strategies is more effective at making our lives as a whole go well 
than the alternative of plugging guesswork into a formula for maxi- 
mizing expected utility. Thus, it is no wonder we so rarely make any 
attempt to calculate expected utilities, for the truth is that we 
usually have better things to do. 
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When goals are in conflict, there may not be any well-defined 
sense in which one way of resolving the conflict is better than the 
alternatives from the viewpoint of one’s life as a whole. Of course, 
we do well to cultivate moderate preferences so as to reduce the 
frequency and severity of conflicts of value. But at the same time, 
there are limits to what one should do to avoid situations of under- 
determined choice, for the risk of finding oneself in such situations 
is a risk we assume in the process of becoming rationally committed 
to particular ends as ends in themselves. A life without regrets (i.e., 
regrets about decisions made under unresolved conflict) is prefera- 
ble all other things equal, but if the lack of regret is purchased at a 
cost of not having goals that can come into unresolvable conflict, the 
price is too high. To adopt a number of goals as ends in themselves 
is to risk finding oneself in situations where global optima do 
not exist, but there are reasons why a global optimizer would take 
that risk. 

I admit that the theory is not particularly neat and tidy, certainly 
not in comparison to the standard model. But tidying up the theory 
at the expense of realism would be a mistake, for the theory is meant 
to be about us, not about mathematically tractable caricatures of us. 
I believe rational-choice theory developed along the lines indicated 
here has more power than standard maximization models to explain 
the ways in which we actually live, but it does not thereby become 
merely a self-congratulating description of how we live. Rather, it 
remains (or becomes) a tool for evaluating and criticizing the ways in 
which we actually live. It sets out a normative ideal of rational choice 
that it would be natural and healthy for us to try to live up to. 

DAVID SCHMIDTZ 
Yale University 
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RIGHTS BETWEEN FRIENDS* 


re there rights between friends? The very question will 

sound odd to some, even perverse to others. Some might 

insist that friendship is one of the few human relationships 
that can be carried on successfully without any recourse to rights. 
People only have rights against those who are indifferent, or actively 
antagonistic, to their legitimate claims. Between true friends rights 
are simply useless, or possibly their use is even offensive. Even more 
strongly, others might contend that friendships should be carried on 
without the admixture of the defensive, aggressive, and individualis- 
tic notion of rights. Indeed, perbaps introducing the idea of rights 
(or allied concepts like claims, obligations, or protection) into an 
account of friendship will inevitably undermine the very notion of 
friendship. Along this line, some might also assert that in practice 
the self-conscious possession of rights will surely lead to the demise 
of any particular friendship. 

This paper will address these questions: Are there rights between 
friends or does the adversarial nature of rights inevitably frustrate 
the truly mutual orientation of friendship? If the case can be made 
that there are rights between friends, just what kinds of claims might 
friends make against one another? To assume simply that there is no 
possible legitimate employment of rights between friends is to en- 
tertain some common, and I shall argue misguided, presumptions 
about both rights and friends. For instance, one must not regard it 
as a given that the existence of a friendship directly results in a 
flawless cooperative partnership—a kind of ‘‘preestablished har- 
mony” between friends. Equally, one must not presuppose that the 
activity of claiming rights is not liable to any limitations. Such pre- 
sumptions lead to a less than sophisticated view of such rich and 
complex notions as rights and friendship. ; 

The central question here—Are there rights between friends?— 
can also be seen as one form of a slightly broader question: Are 
there rights within close-knit communities? Part of an increasingly 
popular “communitarian” critique of a rights-based, “‘liberal’” tra- 
dition in political theory is that in certain kinds of communities— 


* A version of this paper was read at the Seventh Annual Meeting of the North 
American Society for Social Philosophy and the 1992 Pacific Divison Meeting of 
the American Philosophical Association; I would like to thank discussants there 
for comments. I also Neera K. Badhwar, Marcia Baron, Don Beggs, Martin 
Cook, b Felt, Martin Gunderson, Philp Kain, Elizabeth Radcliffe, and Dan 

e, for helpful comments on earlier drafts of this paper. 
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especially close-knit ones like friendships where shared purposes 
and mutual togetherness are the rule not the exception—rights (and 
kindred notions like justice) are not at all valuable.! Indeed, having 
rights may even be harmful to such communities because they inevi- 
tably promote noncommunitarian values like egoism and individual- 
ism, and perpetuate antagonistic attitudes where they have no place. 
I do not plan to address directly the somewhat larger question—Are 
there rights in all types of communities? But given the modest as- 
sumption that friendship is a type of community (explained and 
defended later), some gain in thinking about political theory may 
follow, if we conclude that it is sensible to think that persons have 
rights within the close-knit community that arises between friends. 
At least it may in this way be easier to see how having, and at times 
exercising, rights is not antithetical to the values and virtues of even 


the very closest of communities. 
I 


It seems useful to start by stipulating some general working defini- 
tions of the central concepts in question. For the purposes of the 
following discussion: (1) “Tad has a right” if and only if “Tad has a 
claim protected by (or correlated with) an obligation”; (2) “Tad has 
a right against Tim” if and only if “Tad has a claim protected by 
Tim’s obligation to honor Tad’s claim.” This characterization of 
rights is central to one major tradition of understanding rights.? It is 
worth noting one further quite simple distinction here: Tad may 
have a legal right or a moral right against Tim. Tad’s legal claim, 
based say on a valid contract, would be protected by Tim’s legal 
obligation—for example, to deliver some commodity to Tad ona 
specified date. Tad’s moral claim would, in turn, be protected by 
Tim’s moral obligation—for example, not to use Tad’s private diary 


1 See for example, Michael J. Sandel, Liberalism and the Limits o, Justice (New 
York: Cambridge, 1982), esp. pp. 28-35; Carol Gilligan, In a erent Voice 
(Cambridge: Harvard, 1982), esp. pp. 100, 136f, 149, 164f, 174; Jo Hardwig, 
“Should Women Think in Terms of Rights?” Ethscs, XCIV, 3 ril 1984): 441- 
55; Robert Young, “Dispensing with Moral Rights,” Political , VI (1978), 
esp. pp. 67-8; Karl Marx, “Critique of the Gotha ” in The Marx Engels 
Reader, R. Tucker, ed. (New York: Norton, 1978), pp. 525-41. For a good discus- 
sion of the view that Marx’s view of the ideal community, the highest stage of 
communism, would forego or transcend rights, see Philip J. Kain, Marx and Eth- 
ws (New York: Oxford, 1988), pp. 12, 186, 200, 202f. 

2 See, for example, Joel Fei “The Nature and Value of Rights,” Journal 
of Value iry, IV (1970): 243-57; A. I. Melden, Rights and Persons (New York: 

ord, 1977), PR. ff; John Stuart Mill, Utilitarianism (Indianapolis: Hackett, 
1979), p. 52; W. D. Ross, The Right and the Good (New York: ord, 1930), p. 
50, Judith J. Thomson, The Realm of Rights (Camb : Harvard, 1990), esp. pp. 
37-60; Richard Wasserstrom, “Rights, Human Rig and Racial Discrimina- 
tion,” this JOURNAL, LXI, 20 (October 29, 1964) 628-41. 
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for his summer reading at the beach. To have a right against some- 
one in this general sense is simply to have a legitimate claim (either 
legal or moral or both) protected by their (legal or moral) obliga- 
tion(s). For the present, other complexities often deemed relevant 
to the analysis of rights talk (for example, powers, privileges, liber- 
ties) can be set aside. 

In this paper I shall talk of a claim protected by an obligation 
rather than one simply correlated with an obligation. I do so princi- 
pally because the notion of protection helps emphasize what is often 
regarded as the defensive posture represented by the concept of 
rights. The notion of “protection” is meant to do the same logical 
job as “‘correlation’”’—that of directly linking Tim’s claim to Tad’s 
obligation. This characterization, however, does suggest something 
about the nature of that link. Tad’s obligation is not merely joined 
indifferently (that is, simply correlated) to Tim’s claim; it offers 
some promise of protection in the case of a transgression. The prac- 
tice of claiming rights is clearly related to the protection of (or the 
justified attempt to protect) one’s interests. This use of the notion of 
protection is not put forth here as an independent argument about 
the idea of rights. Those who prefer ‘correlate’ to ‘protect’ can make 
the substitution without disturbing the present argument. I simply 
hope to use an account of ‘having a right’ which is coherent as well 
as sufficiently strong enough to grant credence to the initial prob- 
lem raised by many of the communitarian critics of rights.* The goal 
at this point in the argument is simply to avoid begging the question 
in favor of the conclusion—roughly, that friendship and rights are 
fully congenial ideals—by watering down the notion of rights (or 
friendship) so as to make the case for their compatibility essentially a 
foregone conclusion. 

Friendship is also a complex social and moral phenomenon. For 
the purposes of this paper the following characterization, roughly 
Aristotelian in inspiration, will be helpful to get the discussion 
started.* Jill and Jane are each other’s friends when both Jill’s and 
Jane’s own well-being has as one central constitutive feature the 
well-being of the other. As friends, both Jane and Jill have as one of 


* The notion of rights as claims is also important here because the centrality, 
and complexity, of the actruzty of lamang right calls for close investigation. Also, 
one must not confuse protecting the and protectung the friendshtp—the 
former may not always result in the latter. Claiming rights may only have a reme- 
dial effect on the fnendship, or it may, but it need not, diminish it. See sections 
III and IV below. 

4M Ethics, Terence Irwin, trans. (Indianapolis. Hackett, 1985), bks 
8 & 9, pp. 207-66. 
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their ongoing concerns the welfare and happiness of the other, and 
this concern for the other is, in large part, for the sake of the other. 
In the words of Lawrence Blum’s® recent account of friendship, 
friends have ‘‘a disposition to act to foster [the good of the friend] 
simply because the other is one’s friend” (ibid., p. 43). Friends at 
least share a disposition to engender those conditions which provide 
for the flourishing of both the friend as well as the friendship. Like 
Aristotle’s, this characterization of friendship has the mark of an 
ideal. So, it is best to note at the outset that it might not always (or 
often) be fully realized in what we commonly call friendships. We 
are discussing here what might better be called a true or a genuine 
friendship. 

A close-knit community, like a friendship, has a deeply mutual 
orientation. In a true community (again, an ideal notion) narrow 
self-interest is tempered by, indeed transformed by, a concern for 
the common good—that is, the well-being of the other members of 
the larger community. As between good friends, individuals within 
close-knit communities consider acts of self-sacrifice which are 
beneficial to the common good to be, in part, in their own (self-) 
interest. Indeed, the similarities between a community, as a group of 
persons whose own individual good depends on the good of others, 
and a friendship, leads to the plausible conclusion that a friendship 
is a kind of community. A friendship is perhaps a small, ardent, and 
especially intimate community. In this regard John Finnis® suggests 
that “the most intense form of community [is] the friendship of two 
friends” (ibid., p. 141). Finnis’s comparison, which does have some 
shortcomings, suffices for the present argument. True friendships 
share enough characteristic features with genuine communities— 
mutuality, a complex intermingling of self-interest, and some dispo- 
sition to promote the association itself—that it is reasonable to treat 
the case for rights between friends as relevant to questions about 
the compatibility of rights and community. 

Now, the problem here is not simply whether persons would enjoy 
better friendships without having recourse to rights, but whether 
there is some deep incompatibility between the kind of community 
promoted by true friendship and the kind of friendship it is possible 
to coordinate, at least in part, with rights. In short, having—and 
exercising—rights seems most appropriate in: (1) an individualist 


7 Trine Aea and Morality (New York: Routledge, 1980). 

6 Natural and Natural Rights (New York: Oxford, 1980). Aristotle’s ac- 
count of fnendship also, of course, links characteristic features of tical com- 
munity and fri ip, Nicomachean Ethics, bk. 8, pp., 207-32; Polztics of 
Aristotle, Ernest Barker, trans. (New York: Orford, 1979), bk. 1, pp. 1-38; bk 3, 
p 120; bk 4,p 181. 
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atmosphere; (2) an adversarial atmosphere (rights within friendships 
would be rights against friends); (3) an egoistic atmosphere, where 
sentiments of selfishness run high. Individualism, adversarialism, 
and egoism are arguably antithetical to the values and the virtues of 
friendship which are basically communal, cooperative, and often 
deferential to others. The first claim that the same legal and moral 
rights held against adversaries hold also between friends seems to 
lead to a second possibility, that there are in addition even further 
moral rights specific to friendship itself. But for some of the reasons 
just outlined this idea of rights specific to friendships also may seem 
deeply problematic. Finally, what one is able to achieve with rights is 
clearly not the best that friendship has to offer. For example, to 
secure help from a friend by a claim of right is hardly to enjoy 
friendly help. These problems do indicate that while this inquiry may 
well be relevant to the compatibility of individual rights and the 
ideals promoted and protected by close-knit communities, the 
question—Are there rights between friends?—is also of intrinsic 
interest. 
n 

One criticism of the idea that there are rights between friends has 
been made famous by Michael Sandel. He argues that friendships, 
and other close-knit communities, diminish in moral value when 
they are viewed as susceptible to considerations of justice and claims 
of rights. Noting that “justice is the first virtue of [only] certain 
kinds of societies,” Sandel suggests we “consider a more or less ideal 
family situation, where relations are governed in large part by spon- 
taneous affection” and “[i]ndividual rights . . . are seldom invoked, 
not because injustice is rampant but because their appeal is pre- 
empted by a spirit of generosity ...” (op. cit., pp. 30, 33). Sandel 
then describes how this ideal community “comes to be wrought with 
dissension” and the “affection of previous days” gives way to “the 
observance of rights” (op. cit., p. 33). Sandel’s point that “an in- 
crease in justice does not necessarily imply an unqualified moral 
improvement” seems to be, in part, that having and observing rights 
is itself destructive of close-knit community (op. cit., p. 32). 

Sandel’s overall argument, however, seems to conflate having 
rights and claiming rights, at least suggesting that claiming rights 
follows inevitably from having rights. Yet, clearly, having rights does 
not invariably entail the particular action, or even the general prac- 
tice, of claiming them. One might have rights without ever claiming 
them—for one, when they are never violated. One may also have 
them but not claim them even when they are violated—if, in the 
judgment of the rights holder, making this particular claim, though 
warranted, is not the most reasonable course of action given other 


472 THE JOURNAL OF PHILOSOPHY 


considerations. Sandel is clearly unable to show that merely having 
rights—within the best of close-knit communities, they will likely 
operate largely as unneeded reminders—is any real challenge to the 
overall orientation of a community. 

Perhaps, then, the crux of Sandel’s problem is with the practice of 
claiming rights. The examples he offers of the claiming of rights, 
however, hardly form a reasonable test case for the evaluation of the 
effects of claiming rights (or the practice thereof) on close-knit com- 
munities. This is because Sandel’s paradigm case of claiming rights 
between friends is, by his own admission, a case of inappropriate 
claiming of rights. When Sandel says “it can also be shown that in 
some cases justice is not a virtue but a vice,” he somewhat suspi- 
ciously avoids cases displaying any of the true virtues of rights (for 
example, providing a response to unjust intrusions). He offers in- 
stead an inapposite comparison—misguided claims of right against 
mistaken foes (op. cit., p. 34). Sandel notes in this regard that “[iJf, 
out of a misplaced sense of justice, a close friend of longstanding 
repeatedly insists on calculating and paying his precise share of 
every common expenditure,” I shall “feel compelled to be recipro- 
cally scrupulous,” and “may begin to wonder whether I have not 
misunderstood our relationship” (op. cit., p. 35; my emphasis). 

Surely, a reasonable understanding of such a case is that one may 
well have misread the nature, or at least the depth, of the friendship 
in question. The “friend of longstanding” may for various reasons 
be less friendly than one thought. But the natural conclusion here is 
not, as Sandel seems to suggest, that the claiming of rights by itself 
was the reason for the distance in the friendship. Rather, it is the 
friend’s decision to use rights in this way (perhaps intentionally 
“misplaced” to convey dissatisfaction) which has done so. The key 
point is that in Sandel’s example the fact that the claiming of rights 
was misplaced is clearly more important to the dissolution of the 
friendship than the fact that rights were claimed. Likewise, in San- 
del’s other example when “the harmonious family comes to be 
wrought with dissension,” there is no indication that the claiming of 
rights is in any way related to the original diminution of family 
feelings. Here again, it can hardly be said Sandel has shown that 
claiming rights is a “vice,” if there is good reason to claim them. 
Curiously, Sandel never probes the cases necessary to demonstrate 
the point he sets forth to make.” 


7 It is not my point here to deny that ing rights can ever have a causal 
effect on the deterioration of a close-knit relati ip. On that limited point 
Sandel ıs no doubt correct. He has simply not shown—though he seems to assert 
—that the use of rights in such cases ıs typically the main cause of unfriendly 
relations. 
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Noting the difference between having rights and claiming rights 
helps alleviate other possible confusions about the existence of 
rights between friends. One might be said to have human rights, or 
legal rights, against friends without ever choosing, or having good 
reason, to claim them. Take what are sometimes called in rem rights 
—rights not held against particular individuals, but held, as it were, 
against the world at large (or the relevant subset thereof). Take, for 
example, the human right not to be tortured or the legal right to 
free speech. By noting that one does have such rights against all 
others, this does not exclude one’s friends from having the corre- 
sponding protective duties. Having justification to claim such a fun- 
damental right against a friend would, however, reflect negatively 
on the friendship. There is every reason to think that Jill’s friendship 
with Jane is somewhat strained if Jill knowingly facilitates those who 
would torture Jane. Improbable hypothetical scenarios might even 
be generated here—in a misguided attempt to help, Jill might facili- 
tate Jane’s torture at what she supposes would be Jane’s request 
because Jill thinks this may be the best way to save Jane’s children. 
Here even a well-intentioned, though confused, friend can at times 
be property constrained by the consideration that all persons—even 
friends—have human rights. 

Similarly, Tad’s right to free speech binds his friends and enemies 
alike. All relevantly situated persons, be they friend or foe, have the 
duty to respect his right. Clearly, Tad’s friends, qua friends, are 
most unlikely to run afoul of this right. This fact does not, of course, 
entail that his friends have no duties, only perhaps that these are 
duties which they have no real inclination to avoid.® Consider, as 
well, what are called in personam rights—claims where the protec- 
tive duty is held by a particular person, like the rights deriving from 
a promise or a contract. Here also, friends may have rights against 
other friends. Indeed, there may sometimes be good reasons for 
friends to have rights based in promises or contracts, perhaps rea- 
sons based principally on considerations of friendship itself. Con- 
tracts between friends may at times actually be the most amicable 
way to protect the closeness of that relationship from the pressures 


of other less friendly concerns. 
Il 


Now, even if actually having and claiming rights is compatible with 
the values and virtues of friendship, this does not yet fully solve the 


* Surely it is wrong to allege (as some readings of Kant seem to suggest) that 
one does not haue an obligation simply because one does not in fact experience 
that obligation as a real limitation on, or impediment to, one’s present inclina- 
tons. There is certainly no tnconststency in the idea that one has duties (e.g., some 
familial duties) that one is really quite happy to carry out. 
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problem. One further complication that may be raised by some crit- 
ics of rights talk is that the very idea that one has rights (not the fact 
of having them or claiming them) will establish a distance that is 
destructive of friendship. It may be argued that, like the sense of 
justice, the sense that individuals have rights somehow puts them on 
less than friendly terms with other members of close-knit communi- 
ties. One central issue here is the individualism inherent in the no- 
tion of individual rights. The point is that in some fundamental 
sense the idea that persons have, and may claim, individual rights 
puts the individual at center stage, and thereby inherently at odds 
with others. In short, this emphasis on the individual’s claims has a 
corrosive effect on the community of friendship. The idea of individ- 
ual rights, however, is compatible with the central values and virtues 
of friendship. 

Various aspects of individualism which are prominent because of 
this attention to the idea of individual rights deserve mention.’ First, 
individual moral rights provide individuals with a certain secure 
moral standing. As Richard Wasserstrom notes, claiming something 
as a right “is crucially different from seeking it through the grant of 
a privilege, the receipt of a favor or the presence of a permission” 
(op. cat., p. 630). Self-consciously possessed of rights, individuals 
(even individuals qua friends) can demand that they be treated with 
a certain (minimal) respect. This, however, does not imply that per- 
sons never need favors or indulgence from their friends. It does 
suggest that in some cases, perhaps where the friendship is suffering 
trying times, people need not beg or plead with friends for fair 
treatment. The idea that a person has individual rights—even 
against friends—provides for a certain powerful sense of self-worth 
and one typically not provided without individual rights. 

This sense of self-worth does not, however, necessarily lead to the 
use of those rights in ways destructive of friendship. As noted above, 
a person may have rights and even have them violated and yet 
choose for the moment (for various reasons, including consider- 
ations of friendship) not to claim those rights. In this regard, notice 
also the difference between: (1) actually claiming a right, (2) threat- 
ening to claim a right, and (8) noting that you feel a right has been 
violated.° Between friends, the mere observation that one is in a 


° A most useful account of various versions of individualism inherent in rights 
talk is given by eremy Waldron in Nonsense upon Sulis: Bentham, Burke and 
Marx on the Rights of Man (New York: Methuen, 1987), esp. pp 185-8. 

10 Of course, one may relinquish one’s nghts in various ways (e.g, waiving 
them). For a similar distinction between asserting and withholding a right and the 
subsequent effects of doing so on community, see John Tomasi, ‘‘Indivi 
Rights and Community Virtues,” Ethics, c1 (1991): 525-7. 
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position to claim a right, without yet actually having, or planning, to 
do so can lead to pointed discussion but not necessarily an adversar- 
ial confrontation resulting in the dissolution or diminution of the 
friendship. One possible characteristic virtue of friendship (and 
close-knit community) is the mutual willingness to overlook some 
minor conflicts, or at least the standing inclination to foresee their 
amicable solution. This virtue, however, does not preclude the very 
idea that friends have individual rights. It simply suggests one 
proper framework—a friendly way—for using individual rights, that f 
is, one preserving or even enhancing the friendship.”’ In addition, a 
proper sense of self-worth (not full-blown, narrowly construed 
egoism—which will be discussed below) is arguably a precondition 
for, or at least a desideratum of, friendship itself.’* A proper sense of 
self-worth is not inconsistent with the virtues of friendship. Clearly, 
habits of self-respect inspired by rights can encourage such virtues 
among one’s friends.’° 

A further aspect of individualism that might be seen as destructive 
of the grounds of friendship is what could be called atomism. In this 
case, atomism is the view that the individual is isolated from the 
larger close-knit community by his possession of individual rights. 
“Liberals” like John Rawls"* have criticized classical utilitarianism 
because it “does not take seriously the distinction between persons” 
(ibid., p. 27). In a sense the problem raised by “communitarian” 
opponents of rights is that “liberal” individuals take altogether too 
seriously the distinction—and thereby the difference—between 
persons. Embracing the ideas of rights, and justice, has led individ- 
uals to ignore or downplay their communal similarities. Friends, and 
close-knit communities in general, value and promote the similari- 
ties, not the differences, between individuals. John Hardwig, a femi- 
‘nist advocate of close-knit community, suggests: 


Thinking in terms of rights thus does more than reflect an egoistic, 
atomistic situation; it creates such a situation ... Because thinking in 
terms of rights rests on an atomistic picture of us as separate, thinking 


11 On the idea that the way individuals use their nghts is itself a moral issue, see 
Daniel Callahan, Setteng Limits (New York: Touchstone, 1988), pp. 176f, 
“Dignity, Rights and Self-Control,” Ethics, xcrx (1989). 526-8; Waldron, “A 

ight to do Wrong,” Ethics, xcu, 1 (1981): 21-39. 

Cıcero holds that a type of self-sufficiency is necessary for friendship. See 
“On Friendship,” in Other Selves: Philosophers on Friendship, Michael uk, 
ed. (Indianapolis: Hackett, 1991), pp. 79-116. 

13 This is not, of course, to imply that friendship is not itself a powerful source 
of a sense of self-worth. It is The point is simply that a person’s sense of self- 
worth is quite complex and has various sources. 


14 A Theory of Justice (Cambridge: Harvard, 1971). 
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in terms of rights systematically denies the unity, the togetherness, the 
“we” that we are trying to create (op. cit., p. 449; my emphasis).'° 


In light of such an argument, it is worth recalling that even the 
closest of friends are in fact separate in some important ways. In- 
deed, some recognition of these differences is valuable to under- 
standing fully the ideal of friendship itself. Friends, of course, in- 
habit separate bodies. Each friend also has unique perceptions and 
ideas, attitudes and emotions. These simple facts alone account for 
some significant differences between even very close friends. On 
occasion good friends differ over fairly basic matters: what each 
believes is beautiful, amusing, exciting, perhaps even sometimes 
what is required by morality. Most poignantly, friends may even 
disagree about what is called for by friendship itself. 

It is also true that friends are in many ways quite similar in their 
experience and evaluation of the world. Aristotle suggests that in 
the highest form of friendship the virtues of the good man are 
mirrored back to and reinforced by his friends.!® Yet what makes 
this experience of mutual support especially valuable for friends is 
the recognition and enjoyment of the fact that each individual per- 
son is part of that friendship as a distinct, unique person. Leaders 
with sycophantic advisors tend to get less than the best advice. Hav- 
ing one’s views reinforced by someone who cannot do otherwise— 
very young children are at times related to their parents this way— 
is, in truth, considerably less reassuring than genuine autonomous 
support. 

As friends, Jane and Jill may well be more similar, in some funda- 
mental ways, than they are different. Yet their distinct individuality 
adds to, rather than detracts from, the quality of their friendship. It 
may even enhance their experience of that friendship. In a world of 
strangers, friends are especially esteemed because their differences 
from us do not estrange us from them. These inevitable differences 
between friends also shape the process of becoming friends. The 
gradual integration into any friendship proceeds from the fact of 
prior differentiation. Deepening friendships progress with the 
growing appreciation that the friend has considerable life experi- 
ence prior to, and outside of, the friendship. In general, discovering 


18 For one account of why some feminists have a similar antagonism to rights as 
“an idiom associated with masculine concerns,” see Waldron, Nonsense upon 
Stilts, pp. 159-60. 

18 Nicomachean Ethics, bk. 9, esp. chs. 4 & 8, pp 245-8, 253-6 
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friendly similarities in other individuals does not require ignoring 
the very real differences within close-knit relationships.'” 
IV 

Finally, the idea of individual rights will be seen by some as an 
independent source of egoism. From the point of view of friendship, 
the divisive passion characteristic of egoism is selfishness. For pres- 
ent purposes selfishness, or narrow self-interest, can be character- 
ized as an overriding concern for one’s own self-interest without any 
(or proper) concern for how this affects others. Jeremy Waldron 
notes that Bentham claims “that the encouragement of the selfish 
passions is ‘the perpetual and palpable object of a Declaration of 
Right’.”"!® But is crass selfishness a product of the very idea that 
individuals possess rights? While the use of rights in any particular 
case may not be wise or prudent, Waldron notes that: 


A theory of rights leaves agents needing more in the way of moral 
guidance, not because rights are egoistic . . . but because rights offer no 
guidance at all—egoistic or otherwise—to those who have them... 
Certainly rights aim to weaken some of the bonds of social life, but that 
is to be construed as the promotion of egoism only by those who con- 
sider community essentially bondage . . (ibid., p. 195) 


Waldron’s observation helps explain how the idea of rights between 
friends may avoid the various pitfalls of an unfriendly selfishness. 

The idea of individual rights allows for and in some cases may 
require a discussion of their proper, or even their optimal, use in 
particular cases. The use of rights can be, and between friends 
should be, guided by what might be called communal values. For 
example, one of the values protected by rights between friends 
would be mutual trust. Tad has a right that Tim not furtively read 
Tad’s private journal even though Tim may quite innocently want to 
know his good friend even better. This right protects the enjoyment 
of a range of values. It protects Tad’s individual privacy. It also 
helps secure the ongoing trust between them as friends. 


17 A, I. Melden, Rights in Moral Lrves (Berkeley: California UP, 1988), pp. 124, 
126, is one of a few recent writers at least to mention rights between fends; cf. 
also Neera K. Badhwar, ‘The Circumstances of Justice: Pluralism, Community, 
and Friendship” (forthcoming in The Journal of Political Philosophy, I). Henry 
Sidgwick, The Methods of Ethics (London: , 1907), pp. 248-50, 256-9, 
has an msightful discussion of dutes (if not yet rights) between friends, as does 
Kent Greenawalt in “The Right to Silence and Human Dignity,” in The Constitu- 
tion of Rights: Human Dignity and American Values, Michael J Meyer & William 
A. Parent, eds. (Ithaca: Cornell, 1992), pp. 192-209 

18 Waldron, Nonsense upon Stilts, p. 191; the entire quotation from Bentham is 
on p. 48. Waldron has a good account of related senses of selfishness, pp. 191- 
201. 
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We can distinguish, roughly, between personal values (states of 
affairs valued independently of one’s relation to others—e.g., indi- 
vidual physical security) and communal values (those valued because 
of one’s membership in some larger community—e.g., mu- 
tual trust).’* An egoist, having little or no concern for how his self- 
interest affects others, will be unable to appreciate communal val- 
ues, except perhaps instrumentally. As a good friend, however, Tim 
would value Tad’s privacy for Tad’s sake. While an unfair invasion of 
Tad’s privacy by a third party would hurt Tad, it would thereby, as 
his friend, hurt Tim as well. But Tim’s interest in proper respect for 
Tad’s privacy is also rooted in his concern for mutual trust within 
the friendship. The trust destroyed by Tim’s disrespectful invasion 
of Tad’s privacy will likely be a much deeper violation than that of 
any stranger. This violation hurts both Tad and Tim; it also hurts 
their friendship. It damages an essential mutual trust. To suggest 
that this disrespectful invasion of privacy harms their friendship is to 
say that it harms Tad and Tim as friends. As a friend one has an 
individual stake in another’s personal values. One also has a stake in 
the communal relationship itself—that is, in part, one has a stake in 
one’s own position as a friend. 

Yet Waldron’s claim that there are “many types of human relation 
where standing on one’s rights would be inappropriate” seems over- 
stated.” All that is proper to note here is that there clearly are times 
within some close-knit communities where it would be wrong—that 
is, inappropriately egoistic or in some way misguided—to do so. 
After Tad and Tim have agreed to share private journals, and after 
Tad has already shared his own, then it might well be wrong for Tim 
to raise the issue of his rights to privacy.”! The simple point is that 
friendships should not be viewed as a special kind of relationship 


1 For interesting but more elaborate accounts of ‘‘communal values” than it is 
necessary for present purposes to explicate, see Gerald J. Postema, ‘Collective 
Evils, Harms and the Law,” Ethics, xcvi, 2 (1987): 41 0; and Mark Sagoff, 
“Values and Preferences,” Ethics, XCVI, 2 (1986): 301-16. 

™ Waldron, Nonsense upon Stilts, pp 188-9 (my is on ‘types’). It is 
unclear why Waldron makes this point given that he says ia 191-2) that to have 
a nght is neither to be protected from all moral cnticism nor a seal of moral 


! Even here it may be a difficult case The problem ıs not that Tim has no rights 
to privacy because he waived all of them as part of his agreement to share jour- 
nals If Tim comes to a reasonable decision that by sharing his journal Tad has not 
revealed himself at all—perhaps his journal is especially prosaic and reticent— 
Tim may be tempted to reassert that his own right to privacy exempts him from 
the claim against him that he share it with Tad. The actual assertion of such a 
claim may well compromise the fnendship with Tad, but if it comes to that, Tim 
may already have been put off by Tad’s meager (or manipulative) attempt at 
openness 
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which eschews rights talk. Rather, friends require a special sensitiv- 
ity to the reasons provided for various uses of rights. 

Many of these aspects of individualism outlined in the last three 
sections, and thought to be inherent in the idea of individual rights, 
may appear to culminate in the prospect of adversarial relations 
between friends. Rights between friends are, of course, rights held 
against friends. Yet the potential adversarialism inhering in the idea 
of rights exists only insofar as their actual use has that intent or 
outcome. As argued above against Sandel, the adversarial use of 
rights is not an inevitable consequence of having rights, neither is an 
adversarial posture somehow inherent in the idea of rights. Now 
clearly, a theory of what rights one has should not be taken as a 
complete account of one’s moral relations. Equally clearly—but less 
often recognized by proponents of individual rights—an account of 
individual rights should be supplemented by a theory of virtue. This 
is necessary because the idea of oneself as possessed of certain rights 
falls short of thoughtful guidance on their use in specific circum- 
stances. The idea of rights would be greatly benefited by an account 
of their reasonable use in particular contexts. Claiming rights can be 
modulated by a commitment to many values, not the least of which 
are those of friendship and community.” In this way it can be ap- 
preciated that the idea of individual rights between friends need not 
be the path to unfriendly individualism. 

v 


One important final question exists about the character, or the qual- 
ity, of any friendship that one regulates, in part, with rights. It might 
be said that the essence of some communal values, like mutual trust 
between friends, cannot be maintained by claims of right. The prob- 
lem can be put this way: what friends want out of any friendship is, 
in its very nature, diminished or destroyed, if it is secured through a 
claim of right. In short, from a friend one wants spontaneous, not 
grudging, concern. The quality of the concern received, if one can 
claim it as a right, is simply not the same as if it is spontaneously 
given.” Or, more strongly, if concern is got by claiming it, the es- 
sence of what is really desired—spontaneous care—is simply ef- 
faced. Two points about this observation are worthy of attention. 


2 Waldron, Nonsense upon Stilts, p. 196. 

™ This point was first pointed out to me in this context by Badhwar. On this 
question of Haag cf. her ‘Circumstances of Justice: Pluralism, Community, 
and Friendship,” pp. 27-9 (in manuscript) See also Bernard Williams, ‘‘Persons, 
Character, and Morality,” in Moral Luck (New York: Cambndge, 1981); and Mar- 
cia Baron, “The All Moral Repugnance of Acting from Duty,” this JOURNAL, 
LXXXI, 4 (April 1984): 197-220. 
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First, to say that a friendship is regulated, in part, with rights is 
not to say that every aspect of the friendship is a matter of claiming 
rights. It has not been my argument here that rights are always the 
only, or the best, way to regulate a close-knit community. A very 
good friendship will transcend the need to claim rights most of the 
time. And a perfect friendship might well always forego the claiming 
of rights. But not all friendships—sometimes perhaps not the best 
of them—are perfect. Even between close friends of longstanding, 
avoiding certain potential transgressions may require respect for the 
friend’s independent status, and at least her possession of some 
rights. If one fully recognizes the independent status of a friend, 
then she will not be seen primarily as someone living to fulfill all of 
one’s own desires. In the best of friendships there is no direct harm 
to the friendship, or the quality of the concern thereby derived, to 
see the other as a bearer of rights, since this status alone entails no 
actual claims. Here, spontaneous concern is not destroyed by claim- 
ing it as a right, because no actual claim is ever made. 

Second, to note that receiving respect by a claim of right (or out 
of regard for the possession of rights) is not the best that friendship 
has to offer is not yet to show that such respect is no good at all, 
even between friends. Jane may feel that, if in some cases Jill re- 
spects her privacy only after it is claimed as a right, their friendship 
has a long way to go to suit Jane. Of course, that is not to say that 
Jane would rather have her privacy violated than claim respect for it. 
Neither does it show that in such a case Jane would necessarily wish 
to have never become, or no longer remain, Jill’s friend. Even be- 
tween friends it is sometimes better to enjoy certain goods, like 
respect, by claiming them, than to suffer their loss or to suffer the 
loss of the friendship.%* Securing respect between friends, even as 
an afterthought of sorts, can be better than not securing it at all. 
The obvious desirability of faultlessly spontaneous respect and con- 
cern notwithstanding, from the point of view of friendship, second 
best is not utterly unacceptable in every case. What clearly is unac- 
ceptable between friends is a steady diet of antagonism and selfish- 
ness. While it seems that only unusual friendships—if any at all— 
survive persistent egoism, rights between friends, properly under- 
stood, militate against egoism. Even the most well-meaning of 
friends may, at times, fail to accord proper respect to a friend. Any 


* What may happen in such a case is that between friends the scope of a right 
to privacy sometimes becomes less clear. This may also be a case w a right 
need not actually be claimed to secure the respect in question. While even thi 
compliance because of the recognition of but not the claiming of nghts falls short 
of the spontaneous respect often expected from frends, it goes too far to say that 
here the respect 1s destroyed. It is diminished, but diminished respect (even from 
friends) may be better than the loss of the respect or the friendship. 
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view of friendship that rules out the very possibility of insensitive 
disrespect or conflict between friends seems at best a romantic, at 
worst a wholly unrealistic, view of friendship. 

VI 


If there are rights between friends—especially rights specific to 
friendship itself—What are they? I shall mention only a few rights 
here so as to initiate this discussion. These could be taken as possible 
paradigm cases of rights that hold between very close friends. In 
addition, what would be taken as a good reason to claim such rights 
in individual cases is, in part, a function of the character, the depth, 
and the nature of the particular friendship. This being the case, little 
can be said outside of more specific contexts about when and how it 
would be reasonable, or reasonably friendly, to exercise such rights. 
Finally, nothing argued so far should be taken to show that one has 
a right to a particular friendship (or, for that matter, friendship 
in general). The rights delineated here arise out of existing 
friendships.” 

I have already discussed rights to respect and concern between 
friends, especially respect for some level of privacy between friends. 
But there is nothing unique to friendship about privacy (that is, one 
expects to be accorded some privacy—likely even more—from 
strangers). So, take as a further example a right that is arguably 
unique to friendship and the close relationship it fosters. Between 
very close friends there is a right to be treated openly within the 
relationship. It is important to distinguish this claim to be treated 
openly from the claim to be treated honestly. Although openness 
and honesty sometimes have mutually reinforcing effects, openness 
goes beyond a simple willingness to tell the truth. It is a disposition 
to be forthright about it. Among close friends the virtue of openness 
is a general disposition to be forthcoming with some of the truth, 
especially about oneself. One is hardly expected, in this sense, to be 
open with strangers or even mere acquaintances. But some willing- 
ness to communicate openly one’s own concerns, hopes, joys, and 
fears is essential to the type of sharing one comes not only to desire 
but to expect—especially when one is open oneself—from very 
close friends. Such openness is not, however, to be mistaken for a 
confessional disposition. To meet legitimate expectations of mutual 
openness, Jane need not always bare her soul to Jill. The desirability 
of this one aspect of intimacy does not entail that even very close 
friends need be intimates at all times or in every way. But certainly 
the ongoing deepening of a close friendship requires, from the 


™ To say that such rights derive from the commitment (at least tacit or hypothet- 
ical) already made within existing friendships is obviously not a full account of 
how such special rights arise. That topic must be the subject of another paper 
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point of view of each individual friend, a commitment to some 
openness and intimacy.” 

Between close friends one also has a right to concern and respect 
for one’s own sake. A general disposition to respect such a claim 
seems to be another virtue of friendship—the virtue of mutual altru- 
ism. This regard for one’s friend for her own sake, and not for the 
sake of someone else, is also one of the core values of friendship. To 
appreciate this particular claim between friends one need only rec- 
ognize the strong sense in which one should regard one’s true 
friends as unique. In other words, it violates the proper respect due 
to a close friend simply to replace her friendship with the friendship 
of someone else (to believe that one’s friends are fungible, easily 
exchanged like a tourist swapping foreign currencies). Similarly, it 
violates any friend’s basic rights to value her in order to enhance a 
friendship with someone else. None of this is to say that it is inappro- 
priate to have a wide range of friends or share different aspects of 
oneself with different friends. It is simply to note that it is too weak 
to say that Jane would be put off if she discovered Jill was her friend 
primarily as a way to get to know Tom better. Jane, and her claim to 
basic respect as a friend, would be violated in a deep way if Jill was 
her friend not primarily because of anything about Jane, but for the 
sake of Tom’s friendship. True friends have a right to be treated not 
as simply useful to other goals, or as generic but fungible place- 
holders, but as real and unique people. Part of what is deeply im- 
portant about friendship is being cared about because of (and some- 
times in spite of) the singular and distinctive person that one is. 

Between close friends other rights include a right not always to be 
asked to place a friendship before one’s other needs, including 
other friends. Friends have a right that their closest and most 
friendly relations not be all consuming all of the time. Between 
close friends one also has the right to the benefit of the doubt in 
cases of conflict. This is not to suppose that friends are never inap- 
propriately egoistic but only that a close friend has a right that this 
not immediately be thought to be the cause of a conflict. The obser- 
vance of the existence of such a right helps put conflicts between 
friends in an appropriate perspective. Between close friends each 
person also has a right to roughly equal influence on the friendship. 
Observing such a right reflects an essential commitment to compro- 
mise. In fact, the spirit of this virtue of compromise underlies the 
likelihood of the general observance of all the rights between friends 
mentioned so far. Whatever kindness one shows to a close friend, 
one can usually expect him to be disposed to return it in full. Of 

7 It 1s not my point here that every failure of o es or int is best 


described as a violation of rights, but only that a complete and unilateral failure to 
be open might well be so described. 
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course, such equal influence is never perfect, but failing to honor 
the claim to it not only harms the friendship but fails to accord 
proper respect to the individual friend in question. 

In conclusion, it must be emphasized that any account of rights 
between friends is clearly not all that there is to friendship. Not 
everything one has reason to give to friends is owed as a matter of 
rights. This indeed would be an impoverished view of friendship. It 
has not been argued here that friendship is fundamentally right- 
based—that is, neither the practice of friendship nor the basic 
moral impulse that gives rise to it can be reduced to the observance 
of rights.?” Close friends will spontaneously go to great lengths for 
one another; and they will do so without either claiming rights or 
giving any thought to the existence of rights. But this happy fact 
about friends ought not tempt us to embrace a utopian account of 
friendship which puts friends beyond the possibility of conflict. 
Clearly, such a fanciful vision would fail as a suitable account of 
actual friendships and close-knit communities. 

Equally clearly, it is deeply misguided to suggest, as some oppo- 
nents of rights in close-knit communities do, that “if you are my 
friend .’. . [you can invade my privacy, . . . fail to respect my private 
property, verbally abuse or perhaps even physically assault me, and 
it is all right so long as I know that you are my friend.”** No individ- 
ual need mutely suffer harm and abuse from strangers or friends. 
The virtues of self-respect offered by individual rights need not take 
a back seat to the excesses of some close-knit relationships. The 
fundamental mistake of seeing the rights of friends as antagonistic 
to the virtues of friendship obscures our vision of the compatibility 
of individual rights and close-knit community. Even beyond this 
compatibility, thinking about rights between friends may help us 
both understand something about the place of individuals within 
close-knit communities, and clarify the idea that there are reason- 
able, and unreasonable, uses of some rights. At times to protect 
themselves, and the friendship, individual friends—qua friends— 
must be positioned to go beyond merely explaining what they need. 
Rights provide one good, and sometimes proper, way of doing so. 
If, between friends, we have reason to take rights seriously, that 
does not mean we take friendship any less so. 

MICHAEL J MEYER 
Santa Clara University 


Y! Nor has it been claimed here that the virtues of friendship are of instrumen- 
tal value only—that is instrumental to the Ratan ae See J Raz, 
“Right-Based Moralities,” in Theories of Raghts, Waldron, (New York ord, 
1984), pp. 182-200. 

™ Hardwig, p 56 
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RATIONALITY AND THE PRISONER’S DILEMMA IN 
DAVID GAUTHIER’S MORALS BY AGREEMENT* 


n Morals by Agreement, David Gauthier! defends “the tradi- 
I tional conception of morality as a rational constraint on the 

pursuit of individual interest” (MA, 2; my italics). His defense 
is set against the background of a prominent view of rational con- 
duct, the view that Gauthier calls “the utility maximizing conception 
of rationality” (MA, 6f). The maximizing conception in its standard 
interpretation holds that 


an act of a person is rational if and only if it is expected to maximize 
p’s own interest or preferences.” 


This principle codifies the idea of immediate connection between 
rationality and the direct pursuit of individual interest—the idea 
that “there is a connection between [practical] reason and interest 
—or advantage, benefit, preference, satisfaction, or individual util- 
ity” (MA, 6). 

The traditional conception of morality Gauthier attempts to de- 
fend is clearly at odds with the maximizing view of rationality. The 
conception of morality does not deny the apparent fact that morality 
may and often times does conflict with the direct pursuit of individ- 
ual interest; and it holds that in some cases of such conflicts it is 
rational to comply with moral commands at the cost of one’s own 
interest or preferences. Thus, its defense requires justifying why it is 
rational at all to comply with moral commands even when they re- 
quire sacrificing one’s own interest. 

Yet Gauthier wants to base his argument on the maximizing con- 
ception of rationality itself. He somehow thinks that the maximizing 
conception, in one version or another, is “the only one capable of 
critical examination” (MA, vi). Whether or not one agrees with this, 
his attempt, if successful, would result in a strong and desirable 
defense of morality. The opponents of (the rationality of) morality 
typically adhere to the maximizing conception;? so the justification 


* I am grateful to David Gauthier, Gregory Kavka, and Philippa Foot for their 
comments on the earlier versions of this : 

! New York: Oxford, 1986. Hereafter 

*Cf. MA, 6 “a person acts rationally if and only if she seeks her greatest 
interest or benefit.’ 

* In particular, I take Gauthier’s opponent, Hobbes’s Foole (cf. fn. 8), as an 
avowed promulgator of what he takes to be the correct implication of rationality 
as maximizing of individual interest. 
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of morality in terms of their own conception of rationality, if avail- 
able, will turn out to be the most conclusive argument against them. 

The strategy of his argument might be best explained with an 
analysis of the prisoner’s dilemma. Recall a standard description of 
the (two-person) prisoner’s dilemma, the possible outcomes of 
which are summarized in the following table, where numbers indi- 
cate the years the two suspects, he and I, will live in jail:* 








He 
í, he) Confess Not 
I Confess (5, 5) (1, 10) 
Not (10, 1) (2, 2) 





Table 1 


In this situation, adherence to the maximizing rationality by both 
agents—i.e., their unrestrained pursuits of their respective individ- 
ual interests—yields them a suboptimal outcome: 5 years in jail for 
each. If, however, they can effectively cooperate, they can reap a 
better outcome of two years in jail for each, an outcome which is fair 
and more advantageous to each than that of their individually “‘ratio- 
nal” actions. 

Gauthier’s approach is to view morality as a code agreed upon to 
enable a fair and mutually advantageous social cooperation, which, 
as the prisoner’s dilemma is apt to show, may sometimes be at vari- 
ance with the direct pursuit of individual interest. To justify the 
constraint that the contractarian morality imposes on the direct 
pursuit of individual interest, Gauthier argues that when the agents 
in a situation characterized by ‘‘a conflict as well as identity of inter- 
ests” (ibid., p. 126) (such as the prisoner’s dilemma) attempt to 
reach the optimal cooperational outcome on the basis of the maxi- 
mizing conception of rationality, they find it rational (in the maxi- 
mizing sense) to agree not to confess. Furthermore, he argues, they 
find it rational to form a disposition to abide by the agreement so 
made—given certain provisos. That is, straightforward pursuers of 
individual interests (‘‘straightforward maximizers”’) have the reason 
(by their own standards) to change into “constrained maximizers,”’ 


4 Here I shall not describe the pnsoner’s dilemma I just refer to the descrip- 
tion ın MA, 79-82. See also John Rawls, A Theory of Justice (Cambridge: Harvard, 
1971), p 269. 
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namely, those who restrain the pursuits of their respective interests 
in terms of, say, the contractarian morality. 

In order to elucidate the nature of his argument, I shall follow 
Gauthier in concentrating on the analysis of what it is to act ratio- 
nally in the prisoner’s dilemma. There I shall attempt to show why 
the rationality of (the act of) forming a disposition is one thing and 
the rationality of acting in conformity with the disposition quite 
another. Thus, I argue, in justifying the rationality of complying 
with morality viewed as a code enabling social cooperation, Gauthier 
has as yet to vindicate the principle connecting the two: 


If it is rational to be disposed to restraint in the direct pursuit of 

individual interest, then it is rational to act in accordance with the 

disposition. 
Finally, reflecting on the implications of adopting this principle, I 
shall argue that Gauthier’s justification, properly understood, turns 
out to presuppose, rather than to derive, that there exists an implicit 
constraint on the direct pursuit of individual interest in what he 
should argue is the proper interpretation of “the maximizing con- 
ception” of rationality. 


I 

Stage I. Recall table 1 of payoffs proposed to two prisoners, he and 
I. Given the maximizing conception (in its standard interpretation), 
it is rational for me to confess under that circumstance. In this 
context it is convenient to talk in terms of the expected years during 
which I shall live in jail depending on whether I choose to confess or 
not—assuming that the longer the expected years, the less the ex- 
pected utility (alternatively, the less it is in my interest or the less I 
prefer it). Now, depending on whether or not I confess, the mea- 
sure, Y, of expected years in jail may be calculated as follows: 


Y (confess) =5Xp+1X(1-p) 
Y (not confess) = 10 X p + 2 X (1 — p) 


where p (0 < p <) is the probability that I initially assign to the 
chance that he will confess. Now, Y (not confess) is greater than Y 
(confess) whatever the initial probability, p, of his confessing may 
be;> for 10 X p is greater than 5 X p and 2 X (1 — p) is greater than 
1 — p. That is, confessing is rational for me. 

I also realize that the situation is symmetrical between him and 
me; given that he is a rational agent with a perfect power of reason- 


* One may reason that whether or not he is assumed to confess I shall be better 
off if I confess This reasoning takes into account the two limiting cases when is 


respectively 0 and 1 
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ing, it is also clear to me that he, too, will choose rationally to 
confess. Consequently, my reasoning continues, I am doomed to live 
5 years in jail; that is, in this situation the measure of the expected 
years in prison is 5 when everything, including his rational action, is 
taken into account. Thus, I naturally want to change the situation so 
as to make him not to confess; and it is the same with him. But, alas, 
that option is not available to us; for we cannot coordinate our 
actions by way of, say, communication. 

Stage II. Suppose now that the situation has slightly changed; that 
is, we are allowed to communicate once. In the meeting I initially try 
to dissuade him from confessing, but I shortly find out that I can 
dissuade him only if I also promise not to confess and vice versa. 
Consequently, we make an agreement not to confess. Assuming, 
following Gauthier (MA, 156), that the bargaining is cost-free, it 
seems rational to make such an agreement; for it renders my ex- 
pected utility at least as high as otherwise.’ In fact, the actual discus- 
sion might not have been essential. Suppose that, fortunately, we 
previously read Gauthier’s book together and that, therefore, we 
were convinced of, say, Gauthier’s principle of “minimax relative 
concession,” which specifies terms of a fair and mutually advan- 
tageous cooperation.’ Furthermore, each of us knows that the other 
has the same understanding. Then I can conclude that we would 
have agreed not to confess were there a chance to discuss. And I, 
knowing his acumen, can be certain that he should be reasoning in 
the same way. In sum, it is exactly as if we made an actual agree- 
ment; so we might rely on the hypothetical agreement. 

Does the agreement, hypothetical or actual, make a substantial 
difference to the situation? Here Hobbes’s Foole? enters into the 
stage, arguing that it makes no essential difference to the situation. 
Given the maximizing conception, the Foole argues, the agreement 
falls short of introducing an intrinsic reason to comply with the 


7 For Gauthser’s account of the principle of mimimax relative concession, see 
MA, 141-5. In this context, however, the specific content of the principle is not 
crucial. What is important is that we both recognize the optimal outcome as the 
one based on a/the fair and mutually advantageous prinaple. 

8 See Hobbes, Leviathan, C. B. Macpherson, ed. iNew ork: Penguin, 1968), 
ch. 15, p. 203: “The Foole ... does not therein deny, that there be Covenants; 

but be questionth, whether Injustice ... may not sometimes stand with that 
Reason, which dictateth to every man his own good ” 
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agreement. Consequently, he continues, the agreement is not suffi- 
cient to secure the cooperational benefits to us, although the agree- 
ment seems rational to make and is necessary for cooperation; for I 
still have a choice between complying with and deserting the agree- 
ment. Here is the table of possible outcomes: 


He 
(I, he) Not Comply 
I Not (5, 5) (1, 10) 
Comply (10, 1) (2, 2) 


Table 2 


It shows that I am facing the same situation as the original dilemma; 
so I find it rational not to comply with the agreement, i.e., rational 
to confess. The agreement turns out to be of no avail, although it 
initially seems rational to make. The Foole, as Gauthier notes, is 
challenging ‘‘what would seem to be the ordinary view that given 
neither unforseen circumstances nor misrepresentation of terms, it is 
rational to comply with an agreement if it is rational to make it” 
(MA, 165; my emphasis). 

Stage III. My dilemma persisting in stage II lies in the fact that I 
did not succeed in making him not confess even by reaching an 
agreement with him; consequently, the agreement fails to upgrade 
my expectation. What is needed is not making an agreement but 
ensuring his compliance. 

Suppose now that we are granted another chance to meet. Before 
the meeting I am considering how to make it certain that he will 
comply. To be sure, the best result of the meeting would be to 
ensure his unilateral compliance regardless of my action, but this is 
simply impossible. So I look for a way of achieving the outcome that 
is the second best result for me: making it certain that he will comply 
by way of ensuring him of my own compliance. Otherwise, I have no 
way to shorten 5 years in jail. The way to ascertain my compliance is 
to form and make him check, with absolute certainty, my disposition 
to constrained maximization, i.e., the disposition to comply with an 
agreement based on the fair and mutually advantageous principle in 
case I am certain that he will also comply with it. He reaches the 


° This is a simplified version of the disposition to constrained maximization, the 
version which I think is adequate for the mmplified situation at issue here. Cf. 
MA, 167f. 
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same conclusion on his own, and both of us succeed in forming the 
requisite disposition. In the meeting we “see” each other’s disposi- 
tion, and reach again or confirm the agreement not to confess. 
Now I can reliably expect to gain the benefit of 2 years in jail. By 
forming the disposition, I have succeeded in lowering the ultimate 
measure, Y, of expected years in jail to 2. 
Hn 


In stage II, the dilemma facing the prisoners is given a practical 
solution; they succeed in reaping the benefit of cooperation through 
an agreement backed up by the disposition to constrained maximi- 
zation. And it is rational in the maximizing sense to be so disposed in 
such a situation. Yet, I argue, the problem of justifying the rational- 
ity of acting as the disposition leads one to is far from being solved. 
In my analysis of the solution of the dilemma in stage III, it is not 
necessary at all to assume that it is rational for him or “me” to act 
in accordance with the disposition to constrained maximization; 
there it is sufficient to observe that the prisoners (he and “T”’) are 
now reliably expected to act in accordance with the disposition— 
that is, not to confess—because the disposition is firmly entrenched. 
In their (“our”) own deliberation, too, their expectations about the 
other’s action are entirely based on the available information con- 
cerning the other’s dispositions. Although it is argued that it is 
rational to form the disposition to constrained maximization, no 
appeal needs to be made to the rationality of acting in accordance 
with the disposition. In sum, the rationality of internalizing a fair 
and mutually advantageous principle of cooperation is not yet 
shown to imply that it is rational to act in accordance with it. 
Thus, Hobbes’s Foole, the staunchest promulgator of the maxi- 
mizing conception (in its standard interpretation), will come back to 
the stage and continue to argue against Gauthier’s defense of moral- 
ity as follows. Consider again, the Foole argues, the situation where 
we have found each other’s disposition to constrained maximization 
with absolute certainty. I, the Foole, grant that it was rational to be 
so disposed. But the question remains whether it is rational to act in 
accordance with the disposition—to be sure, granting that the 
agreement is fair’? and mutually advantageous, and that neither un- 
forseen circumstances nor misrepresentation of terms is found, and 
that “the rationale for [the disposition] does not appeal to any 
weakness or imperfection in [our] reasoning” (MA, 186).™ 


10T understand this as including fairness of the initial position of the agree- 
ment, which Gauthier argues is ensured by satisfaction of what he calls "the 
P oviso.” 
11 All these assumptions are satisfied by description of the situation itself. 
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There are indeed two kinds of differences between the present 
situation (i.e., stage ITI) and the dilemma in the stage IT. On the one 
hand, there are differences based on the fact that I am myself dis- 
posed to constrained maximization; on the other hand, those deriv- 
ing from the fact that I now have different expectations about the 
other’s action because I know that he has a firm disposition to con- 
strained maximization. The first kind of differences, I argue, do not 
count when the rationality of the conforming action is at issue. The 
rationale for this consists in the distinction between questions con- 
cerning rationality and those concerning motivational strength. I, 
the Foole, concede that by forming the disposition I made myself 
strongly disposed not to confess in the present situation. Conse- 
quently, I know that I will not confess. But this is a far cry from 
saying it is rational to act so. 

Imagine that I am reflecting on the situation several years later 
after not confessing: I came out of the jail after 2 years with compla- 
cence in the successful cooperation. At this point, there is no longer 
any doubt about which motivation was the strongest when I was 
deciding whether or not to confess. But the question still remains 
whether it was rational (by the standard of that time) to act in accor- 

“dance with the disposition, and the answer is not automatically affir- 
mative. We often meet with another kind of situation in which the 
distinction is typically made between questions concerning rational- 
ity and those concerning motivational strength. The fact that I failed 
to act in a certain way is not automatically regarded as implying that 
it was irrational for me to act in that way; the discrepancy is often 
explained with an appeal to what is called “weakness of the will.” If 
this is a real phenomenon, why should motivational strength of a 
certain disposition directly imply the rationality of acting in accor- 
dance with it, even granting that the disposition itself was formed on 
a rational basis? 

Then what is rational for me to do in stage III? I, the Foole, again 
suggest a simple calculation of the expected utilities. Look at table 3 
below: 


He 
(I, he) Not Comforn 
I Not (5, 5) (1, 10) 
Conform to the disposition (10, 1) (2, 2) 


Table 3 
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Table 3 is exactly like table 2, but in calculating expected utilities I 
may also take it into account that I now expect him to act in confor- 
mity with the disposition. Consequently, only the second column in 
table 3 really matters. Here are the measures, Y, of the expected 
years in jail, depending on whether I also act in conformity with the 


disposition or not: 


Y (Not conforming) = 1 
Y (Conforming) =2 


This shows that not conforming, i.e., confessing, is rational in spite 
of my entrenched disposition not to confess. 

To emphasize again, this does not mean that I shall not act in 
conformity with the disposition. Rather, I take this as a situation in 
which I absolutely feel like not confessing, although I am aware that 
it is not rational. That is exactly what he, the other party, wanted to 
make certain; it did not concern him at all whether I would find it 
rational to conform to the disposition once it was made absolutely 
certain that I would conform. 

m 
I presented the Foole’s argument as consisting of two parts. In the - 
first part, he argues that the following principle is yet to be justified: 


1. Connecting principle: (given that typical provisos’? are met) if it 
is rational to be disposed to constrained maximization, then it is 
rational to act in conformity with the disposition. 


The latter part consists in persistently applying the maximizing con- 
ception of rationality to the action of conforming to the disposition, 
which leads to the conclusion that confessing is still rational. To be 
sure, Gauthier agrees with neither of these. I shall begin by discuss- 
ing his disagreement with the first point. The discussion will eventu- 
ally lead to his rejection of the second point. 


ness or mperfection in the .” Note that the Foole’s argument for the 
first point takes no advantage of imperfections. On the other hand, if he 
means by a actor a person who chooses to do something if and only if it is 
rational to do so, then he begs the question that the ect actor will 


the “straightforward maximizers” are perfect actors, given the maximizing con- 
tion. Yet the issue is whether they remain to be so when them- 
selves into “constrained maximizers. This is not ensured by the rationality of the 


metamorphosing actions The Foole’s straightforward answer, to be sure, is in the 
negative. 
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He affirms that “if her [the perfect actor’s] dispositions to choose 
are rational, then surely her choices are also rational” (MA, 186).13 
By this he seems to endorse principle (1). But the view itself needs to 
be justified, and the justification should be based on (some interpre- 
tation of) the maximizing conception of rationality; otherwise, he 
has yet to make his case that he is giving “a utility-maximizing ra- 
tionale” (MA, 158) for compliance. Principle (1) might be granted 
to be part of our common conception of rationality, but that is 
another matter. 

Imagine that principle (1) might be justified by the maximizing 
conception of rationality as it is formulated in the beginning of this 
paper:'* 


2. Action-maximizing prinaple: an act is rational if and only if it is 
expected to maximize the agent’s own interest. 

Or, suppose that it is justified as a kind of conceptual truth linking 
the concept of the rationality of a disposition to that of the rational- 
ity of conforming action or that it is justified, independently of 
principle (2), as reflecting a fixed point in our intuitions about the 
concept of rationality. Then it is rather worse than better for princi- 
ple (2) because it results in a reductio of (2): by utilizing Gauthier’s 
argument for the rationality of being disposed to constrained maxi- 
mization and the implication of the principle (2) drawn by the Foole, 
one can show that, given relevant assumptions built into the descrip- 
tion of the prisoner’s dilemma, principle (2) together with principle 
(1) leads to a contradiction. Let me explain. 

Here is the conclusion of Gauthier’s argument, which I grant: in 
the prisoner’s dilemma, 


3. It is rational to be disposed to constrained maximization. 
(1) and (3) imply, as is intended, that 


4 It is rational to act in conformity with the disposition to constrained 
maximization. 


Now, apply principle (2) to the stage III quasi dilemma. Then, as the 
Foole is apt to show, it follows that: 


5. It is rational not to act in conformity with the disposition to con- 
strained maximization. 


and that: 
6. It is not rational to act in conformity with the disposition. 





14 It was based on Gauthier’s formulation in MA, ch. 1. Cf. fn. 2. 
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which is due to the ‘‘only if” part of principle (2). (6) contradicts (4). 
The prisoner’s dilemma, it turns out, helps to elucidate that princi- 
ples (1) and (2), each of which taken independently seems adequate, 
lead to a contradiction when applied to the realistic situation 
marked by “conflict as well as identity of interests.” 

Gauthier may be anticipating this problem when he says that he is 
working with a different interpretation of the maximizing concep- 
tion, which “identifies practical rationality with utility-maximization 
at the level of dispositions to choose.” And he “carries through the 
implication of that identification in assessing the rationality of partic- 
ular choices” (MA, 187). Thus, while rejecting (2) he might be coun- 
tenancing the following principle: 


7. Prmaple of disposition maximizing: a disposition is rational for a 
person p to have if and only if it is expected to maximize p's own 
interest. 


In addition, he certainly needs principle (1), whatever its justifica- 
tion may be, to show the rationality of acting in conformity with the 
disposition to constrained maximization. 

It is quite obvious that principle (1) should be independent of the 
disposition-maximizing principle since this has nothing whatsoever 
to say about the rationality of particular actions. Gauthier might 
continue that it is so much the better for his disposition-maximizing 
conception of rationality. Principle (1) may be simply added as an 
independent principle to the conception. One might motivate the 
move in the following way: we begin by reflecting on the maximizing 
conception in its inarticulate form, and arrive at a correct formula- 
tion of it; and then we can appropriately extend that conception to 
reach a coherent and comprehensive concept of rationality. Now 
think about the present situation: the Foole’s persistent argument 
makes it clear that principle (1) needs to be incorporated into the 
concept of rationality to launch Gauthier’s justification of morality, 
and it turns out that adding it to the disposition-maximizing princi- 
ple can have no bad consequences; then, it is rather mandatory to 
add it (or some other principle that implies it). Or so he might 
argue. This might exemplify a respectable methodology, but it is still 
worth noting that he is, in effect, implicitly working with at least two 
independent principles of practical rationality to show the rational- 
ity of morality; he no longer undertakes to derive the one from 
the other. 

Finally, insofar as he wants to endorse the “maximizing concep- 
tion” in its correct interpretation, he will certainly agree to adding 
the following principle: 
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8. An action is rational if it is expected to maximize the agent’s inter- 
est, on the condition that it does not conflict with an action that is 
ruled to be rational by virtue of principles (1) and (7). 


To prepare for the concluding remarks, let me present the ultimate 
conception of rationality, which I have just reached, as consisting of 
the following principles: 


I. A disposition is rational for p to have if and only if it is expected to 
maximize p's own interest. 
Il. If it 1s rational to be disposed to constrained maximization, then it 
is rational to act in conformity with the disposition. 
III. An action is rational if it is expected to maximize the agent’s 
interest. 
IV. In case of conflict, rulings based on principle (II) together with 
principle (I) take precedence over those based on principle (III). 


Iv 
I am now in a position to look back to assess how far Gauthier 
should have come away from the naive understanding of the maxi- 
mizing conception of rationality in the attempt to justify the rational- 
ity of following the commands of morality. Let me quote Gauthier: 


But Hobbes . . . must revise his conception of rationality, breaking the 
direct connection between reason and benefit with which he began to 
reply. Hobbes needs to say that if ıs rational to perform one’s covenant 
even when performance is not directly to one’s benefit, provided that ut 
is to one’s benefit to be disposed to perform. But this he never says. And as 
long as the Foole is allowed to relate reason directly to benefit in perfor- 
mance, rather than benefit in the dispositions to perform, he can escape 
the refutation (MA, 162; my emphasis). 


This more than adequately makes my point: when one introduces 
the principle of disposition maximizing while rejecting the principle 
of action maximizing, one should be breaking “the direct connec- 
tion between reason and interest.” I made a further point that may 
have also been implicit in Gauthier’s discussions: rejection of the 
principle of action maximizing in its full scope requires the intro- 
duction of the principle governing the connection between the 
rationality of dispositions and the rationality of actions as an inde- 
pendent principle; the latter can not be a consequence of the core 
of the maximizing conception. 

Suppose now that adding principle (1) is acceptable to Gauthier. 
Then the previous point about the connection between reason and 
individual interest needs an elucidation; for it is not merely that the 
direct connection from individual interest to rationality is not avail- 
able in an adequate conception of rationality. Rather, the correct 
picture may be that a constraint to “‘the direct pursuit of individual 
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interest” is found to be embedded in the concept of rationality 
itself, however tenuous its force may appear before serious deli- 
beration. The constraint lies in restricting the application of the 
(weakened) action-maximizing principle (II) in terms of the con- 
necting principle (IJ) and, perhaps more explicitly, the precedence 
principle (IV). 

Lf this is a correct analysis, I may come to another elucidation of 
the nature of Gauthier’s moral contractarianism. Again, let me be- 
gin by quoting some passages in MA, chapter 1, where Gauthier 
presents an overview of his theory: 


Morals by agreement begin from an initial presumption against moral- 
ity, as a constraint on each person’s pursuit of her own interest (8f). 
Let us suppose it agreed that there is a connection between reason and 
interest (8). 

Let it be further agreed that in so far as the interests of others are not 
affected, a person acts rationally if and only if she seeks her greatest 
interest or benefit (6f). 


These statements indicate that Gauthier believes that a constraint on 
the direct pursuit of individual interest, which is absent in the concept 
of rationality in itself, can be derived when the direct pursuits need 
to be compromised in view of the complexity of the situation in which 
we are put. I grant that I might be wrong in diagnosing Gauthier’s 
own view of his project; for instance, the third passage might have 
been introduced just to explain the action-maximizing conception 
of rationality which Gauthier rejects in due course. Still, it may be 
worth noting that the project as I possibly mistakenly understand it 
does not necessarily conform to what is actually going on in his 
contractarian justification of morality. What the contractarian 
moral theory shows may be that what initially does not look like 
much of a constraint turns out to be substantial and powerful when 
applied to so complex a situation as we are put in, a situation in 
which identity and conflict of interests are hard to sort out. If this 
understanding is correct, I doubt it can be significantly claimed that 
“[m]orals by agreement begin from an initial presumption against 
morality” (MA, 8f). 
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NOTES AND NEWS 


The Editors wish to remind our correspondents that notices of scheduled 
meetings and symposia are of value to our readers only if they appear earty 
enough to make attendance possible. Copy for Notes and News must be 
planned and processed in this office in time for it to go to the printer at 
least three months before publication date, the 1st of each month. Accord- 
ingly, an announcement of a meeting to be held, say, in April, should be 
scheduled for our January issue (or sooner) and should be received in this 
office by September. The Editors appreciate notices that come to us ready 
for the printer, requiring little editorial work, in JOURNAL style (see these 
columns), with no imperatives, no prices, no telephone numbers, no unnec- 
essary capitals; and reasonably terse since space is limited. Notes and News is 
included in the JOURNAL when space is available, and on a first come, first 
served basis; so we cannot guarantee publication of an announcement even 
if it is received with all the above conditions met. 


Analysis is pleased to announce its second Essay Competition. Papers may 
be on any topic broadly in keeping with the traditional concerns of the 
journal. An upper limit of 4000 words, including footnotes but excluding 
any end list of references, will be strictly applied. Entries (two copies, pre- 
pared for blind refereeing, accompanied by a word count, and clearly 
marked ‘‘Competition’’) must arrive no later than March 31, 1993. Authors 
should be aged 80 or under on that date; but entrants with interrupted 
academic careers, who believe that there are grounds for relaxing the age 
Tule in their case, may consult the Editor. The winning essays, with a selec- 
tion of other entries, will be published together as a supplement to the 
October 1993 issue of Analysis. Entries should be sent to Analysis, Peter 
Smith, Editor, Phil. Dept., Univ. of Sheffield, Sheffield, S10 2TN, U.K. 


Garland Press is pleased to announce that work has begun on The Freud 
Encyclopedia: Theory, Therapy, and Culture. The Editors of the volume will 
be Edward Erwin, Michael Moskowitz, and James Walkup. Among the advi- 
sors are Adolf Griinbaum, Philip Holzman, and Theodore Shapiro. The 
Editors’ goal is to present unbiased coverage, in alphabetical entries solic- 
ited from scholars around the world, of all aspects of Freud’s life, theories, 
and influence. Publication is scheduled for spring 1994. Inquiries should be 
directed to Prof. Edward Erwin, Editor, Phil. Dept., Univ. of Miami, Coral 
Gables, FL 33124. 

Garland Press is also pleased to announce that work has begun on The 
Encyclopedia of Aesthetics. The Editor will be Michael Kelly of The Journal of 
Philosophy, Cohumbia University. Among the advisors are Arthur C. Danto, 
Mary Mothersill, Richard Kuhns, Marx Wartofsky, Joseph Margolis. The 
Editor’s goal is to cover key philosophical topics in the history of aesthetics, 
including the most recent debates. Publication is expected in 1995. Inqui- 
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ries should be directed to Michael Kelly, Phil. Dept., 708 Philosophy Hall, 
Columbia Univ., New York, NY 10027. 


Harvard University, The Program in Ethics and the Professions, invites 
applications for Fellowships in Ethics. Six Fellowships will be awarded in 
1993-4 to outstanding teachers and scholars who wish to develop their 
competence to address ethical issues in faculties of business, government, 
law, medicine, public policy, and social science. Fellows will participate in 
the weekly seminar of the Program, attend courses in one of the profes- 
sional schools or in the Graduate School of Arts and Sciences, and conduct 
their own research on ethics. The Fellowship usually extends from Sep- 
tember through July. Applicants usually hold a postgraduate degree in busi- 
ness, government/public policy, law, or medicine, or a doctorate in philo- 
sophy, political theory, or theology. Application deadline is January 6, 
1993. For further information, contact Jean McVeigh or Helen Hawkins, 
The Program in Ethics and the Professions, Harvard Univ., 79 Kennedy St., 
Cambridge, MA 02138. 


The Hoover Institution, with support from the National Endowment for 
the Humanities, has recently completed a project for the preservation mi- 
crofilming’ of the Sir Karl Popper Papers, a large personal archive of over 
340,000 documents spanning the whole of Popper’s career. Acquired by 
the Institution in 1986, these materials include holograph and typescript 
drafts of Popper's published and unpublished writings; outlines and tran- 
scripts of speeches and lectures; notes assembled for writing projects (many 
of which are uncompleted); and letters directly related to publications, pro- 
jected publications, and speeches. Printed copies of Popper’s published 
articles are included in many cases; printed books are not included. The 
speeches and writings series is arranged chronologically by date of composi- 
tion or publication, beginning with the original typescript of Popper’s 1928 
doctoral dissertation and ending with an unpublished 1986 holograph on 
probability theory. Also important is the correspondence series with 
hundreds of physicists, biologists, mathematical logicians, philosophers, so- 
cial scientists, and classicists. For additional information, please contact the 
Hoover Institution Archives, Stanford, CA 94305-6010. 


The Stanford Humanities Center will offer up to six external fellowships 
for 19934 in the following categories: (1) senior fellowships for well- 
established scholars; (2) junior fellowships for scholars who at the time of 
application are at least three years beyond receipt of the Ph.D. and nor- 
mally no more than ten. For 1992-3, junior fellows were offered stipends 
of up to $20,000 and senior fellows stipends of up to $30,000. We antici- 
pate offering stipends in a comparable range for 1993-4. The deadline 
for applications is November 16, 1992. Application materials and further 
information may be obtained by writing to the Fellowship Program, Stan- 
ford Humanities Center, Mariposa House, Stanford Univ., Stanford CA 
94305-86380. 
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The North American Association for the Study of Jean-Jacques Rousseau 
invites papers for Rousseau and Criticism/Rousseau et la Critique, an inter- 
disciplinary conference to be held at Trent University, Ontario, Canada, 
May 27-30, 1993. We especially welcome papers on Rousseau and Roman- 
ticism, Rousseau and contemporary feminism, and Rousseau’s influence on 
contemporary literary theories. One- or two-page abstracts should be sent 
by December 1 to Jim MacAdam, Phil. Dept., Champlain College, Trent 
Univ., Peterborough, Ontario, Canada K9J 7B8. 


The Department of Philosophy, Logic and Scientific Method at the London 
School of Economics and Political Science has established a Lakatos Scholar- 
ship for Postgraduate Study in Philosophy in memory of Professor Imre 
Lakatos who taught in the Department up to the time of his death in 1974. 
Candidates for the two-year taught M.Phil. Degree in Philosophy, and for 
the M.Phil/Ph.D. Philosophy research degree are eligible to apply. The ` 
Scholarship is tenable for two years, although the second year is conditional 
on satisfactory academic progress in the first year. Applications for the 
Scholarship for students entering the 1993/4 session should be submitted 
to: the Secretary, Phil. Dept., Logic & Scientific Method, The London 
School of Economics, Houghton St., London WC2 2AF. Closing date: Jan- 
uary 31, 1993. Applications for admission to the above degrees should be 
made to the graduate school in the usual way. 


The international quarterly Philosophical Inquiry on the completion of its 
fifteen years of continuous publication has been reorganized by introduc- 
ing new sections of philosophical interest, increasing its pages and changing 
its cover. It also plans to include a series of special issues on general philo- 
sophical themes as well as special monographs. Scholars and others con- 
cerned may submit their papers for the following special issues: Presocra- 
tics, Sophists, Socrates, Plato, Aristotle, Stoics, Skeptics, Epicureans, Plo- 
tinus, Modern scepticism, Causality, Art and Technology, Aesthetic 
Criteria, Philosophy of Language, and Chinese Philosophy. Dr. St. Virbida- 
kis, Aristotelian University of Thessaloniki, will function as Assistant Editor. 
Editor remains Professor D. Z. Andriopoulos, Phil. Dept., Aristotelian 
Univ. of Thessaloniki, Greece. 


Loyola University of Chicago and Loyola University Press have scheduled 
publication of Volumes IV through VII in their joint Values and Ethics 
Series. For 1992-3 their publication schedule includes works on: the narra- 
tive theology of St. Teresa of Avila; abortion based on prenatal diagnosis; 
antinomies of social theory; and the burdens of distributive justice. Inter- 
ested authors in humanities, history, philosophy, theology, law, education, 
political science, sociology, medicine and the helping professions, etc. 
should send letters of inquiry, CV, an outline of the manuscript, Table of 
Contents, etc. to: Dr. Roseanne Perez-Woods, Chairperson, Editorial 
Board, Values and Ethics Series, Loyola Univ. of Chicago Medical Center, 
2160 South First Avenue, Bldg. 131-N, Maywood, IL 60153. 
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BELIEF ASCRIPTION* 


shall do four things in this paper. First, I shall propose a cer- 

tain theory of the semantics of belief ascriptions as being the 

best theory of their semantics relative to a certain assumption. 
Second, I shall raise three problems for this best theory. Third, I 
shall make what I hope is an interesting connection between the 
main issue addressed and the vexing question about the form that a 
meaning theory for a particular language must take. And, fourth, 
I shall close with a word about the theoretical situation thus 
determined. 

The assumption in question is the widely accepted one that each 
natural language has a correct compositional truth theory. A com- 
positional truth theory for a particular language L is a finitely axio- 
matizable theory of L that issues in a theorem of the form 


An utterance of ‘S is true in L iff such and such. 


for each of the infinitely many truth-evaluable sentences of L. For 
example, a truth theory for English might contain a theorem like 


An utterance of ‘She wrote it’ is true in English iff the female referred 
to by the utterance of ‘she’ wrote the thing referred to by the utterance 
of ‘it’. 


If we assume that English has a correct compositional truth 
theory, then we may ask how a particular class of sentences is to be 
accommodated in such a theory. This is to ask about the kinds of 


A areena indebted to the Spal etalon i gar on Oee npa 
in 1991 at the CUNY Graduate ter. Earlier versions of this paper were read in 
one ee at the Univermty of Wisconsin/Madison, the University of Chicago, 
and SOFIA conference in Salamanca, Spain. I benefited considerably from 
the discussions at these talks, and especially from the comments of Ned Block, 


James Higginbotham, Richard Jeffrey, Jaegwon Kim, Ernest LePore, Richard 
Men and Ernest Sosa. 
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semantic values that would have to be assigned to the components of 
those sentences and the kinds of compositional rules that would 
need to apply to those values in order to crank out a truth condition 
for each sentence of the class. Roughly speaking, this question of 
accommodation within a compositional truth theory is what philoso- 
phers are concerned with when they ask about the logical form of a 
class of sentences. And this is my cue to restate the first of the things 
I shall do—namely, to offer what I take to be the best account of the 
logical form of belief ascriptions. In other words, I shall try to say 
how belief ascriptions would have to be accommodated in a compo- 


sitional truth theory on the assumption that English has one. 
I. THE HIDDEN-INDEXICAL THEORY BALDLY STATED 


If English has a compositional truth theory, how should belief sen- 
tences be accommodated in it? What, that is to ask, is the best theory 
of the logical form of belief ascriptions on the assumption that they 
have a logical form? The theory I shall propose in answer to this 
question is one I shall call the hidden-tndexical theory. It is a theory 
whose essential idea must have occurred to almost anyone who has 
thought seriously about the semantics of belief sentences. I first 
discussed a version of it in “Naming and Knowing”;! I discuss it 
briefly in the form I shall presently give it in “The ‘Fido’-Fido 
Theory of Belief’;* and a version of the theory is nicely presented 
and defended in Mark Crimmins’s and John Perry’s “The Prince 
and the Phone Booth: Reporting Puzzling Beliefs.’”* 

The theory, in my favored version, begins with the claim that 
believing—the relation expressed by ‘believes’ in a sentence of the 
form ‘x believes that S’—is a three-place relation, B(x, p, m), holding 
among a believer x, a certain sort of structured proposition p, and a 
mode of presentation m under which x believes p. The idea is that x 
may believe p under one mode of presentation m, disbelieve p under 
a second mode of presentation m, and suspend judgment altogether 
under a third mode of presentation m”. Actually, I should have said 
that believing is a four-place relation, where the fourth place is a 
time at which x believes p under m, but I shall suppress the temporal 
reference for simplicity. Let me now explain the notions of a struc- 
tured proposition and a mode of presentation before moving on to 
the essence of the theory, its account of the semantics of belief 
reports. 

The sort of proposition I have in mind is a structured entity whose 
basic components always include at least one property or relation, 


! Midwest Studies tn ae u (1977): 28-41. 
? Philosophical Perspecirues, 1 (1987): 455-80. 
? This JOURNAL, LXXXVI, 12 (December 1989): 685—711. 
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and may include any sort of contingently existing object. The sim- 
plest example of such a structured proposition would be a so-called 
singular proposition, such as the ordered pair (Fido, doghood). This 
proposition is true just in case Fido instantiates doghood, the prop- 
erty of being a dog; that is to say, just in case Fido is a dog. Interest- 
ing technical questions arise about the representation of more com- 
plex propositions, but singular propositions will serve present pur- 
poses well enough. All that really matters to the hidden-indexical 
theory is that the propositions we believe are structured entities 
composed out of the objects and properties we are pretheoretically 
taken to be talking about when we express our beliefs. I shall call 
such propositions Kaplan propositions, in honor of David Kaplan, 
who did so much to popularize them.* 

This brings us to modes of presentation. A mode of presentation 
of a proposition is determined by (a) the modes of presentation of 
the objects and properties contained in the proposition and (b) the 
place of those things in the proposition. As a first approximation, 
you might think of a mode of presentation of a proposition as an 
n-tuple of modes of presentation of the objects and properties con- 
tained in the proposition. 

The expression ‘mode of presentation’, as it was introduced by 
Gottlob Frege and as it is used today, is a technical notion, and to 
understand it is to appreciate the need for its introduction. Two 
little stories should help remind us of that need." 

The first is the infamous case of the morning dog and the evening 
dog. There is a certain dog who begs at Ralph’s door every morning. 
Ralph feeds this dog, whom he has named ‘Fido’, and has grown 
attached to it. Ralph believes that Fido is male. There is also a cer- 
tain dog who begs at Ralph’s door every evening. Ralph feeds this 


‘Let a smgular proposition be any ordered pair ((x, ..., x,), ®,), where 
(x, ..., Xa) 1s an n-ary sequence of items and $* is an n-ary property (b 
are customarily dropped for one-membered sequences). Such a proposition is 
true just in case its n-ary sequence mstantiates its n-ary property. way to 
with Kaplan propositions is to treat them all as singular propositions Thus, 
proposition that Fido 1s a dog and Preface is a cat might be 

(((Fido, doghood), (Pieface, cathood)), CONJ} 
where CONJ ıs that relation between two propositions that obtains iff both are 
true, and the proposiuon that some dogs bark might to a first approximation be 
represented as 


(the property of being a dog that barks, SOME) 
where SOME 1s that property that a property has iff it ıs instantiated And so on 
5 The next couple of paragraphs derive from my ‘The Mode-of-Presentation 


Problem,” in C. Anderson and J. Owens, eds , Propositional Attitudes (Stanford: 
CSLI, 1990). 
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dog, too, whom he has named ‘Fi Fi’, and has also grown quite 
attached to it. Ralph believes that Fi Fi is female, and he thinks that 
Fido and Fi Fi would make a really cute couple. Unbeknown to 
Ralph, Fido is Fi Fi. 

The second story is that of the spurious natural kind shmoghood. 
Ralph came upon a race of creatures that he thought comprised a 
previously unencountered biological species, and he introduced the 
word ‘shmog’ to designate members of that species. “A thing shall 
be called a ‘shmog’,” Ralph said, “just in case it belongs to the 
species of these creatures.” Unbeknown to him, however, shmog- 
hood is doghood; Ralph had stumbled not upon a new species but 
upon a new race of dogs, and thus the property that ‘shmog’ has 
been introduced as standing for is none other than doghood. 

Now, when Ralph says “‘Fido is male but Fi Fi is female,” he is not 
being irrational in holding the belief which that utterance expresses. 
But he could not rationally hold the belief that would be expressed 
by an utterance of ‘Fido is and is not male’. Likewise, Ralph is not 
irrational in holding the two beliefs expressed by a morning utter- 
ance of “This dog is male’ and an evening utterance of ‘This dog is 
female’, even though the same dog is referred to in both utterances. 
But Ralph could not rationally hold the belief that would be ex- 
pressed by a sincere utterance of “This dog is and isn’t male’. 

Similarly, when Ralph says “Fido is a dog, not a shmog,” and 
when he says “No dogs are shmogs,” he is not being irrational in 
holding the beliefs those utterances express. But he would be irration- 
al were he to hold the belief that would be expressed by an utterance 
of ‘No dogs are dogs’. 

Appeal to the notion of a mode of presentation is designed pre- 
cisely to accommodate these sorts of data. The intuitive idea, defini- 
tive of the notion of a mode of presentation, may be called Frege’s 
constraint. Stated informally and in a way that prescinds from the 
details of any particular theorist’s account of belief or of belief- 
ascribing sentences, Frege’s constraint has two parts. First it says 
that a rational person x may both believe and disbelieve that a cer- 
tain thing or property y is such and such only if there are distinct 
modes of presentation m and n such that x believes y to be such and 
such under m and disbelieves it to be such and such under m. Then 
it says that there are distinct modes of presentation m and ny such 
that rational person x believes y to be such and such under m and 
disbelieves y to be such and such under m only if x fails to realize 
that m and m are modes of presentation of one and the same thing. 
In other words, you cannot rationally believe and disbelieve some- 
thing under one and the same mode of presentation, or under 
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modes of presentation that you realize are modes of presentation of 
the same thing. The notion of a mode of presentation is functionally 
defined by Frege’s constraint in that something is a mode of presen- 
tation if it plays the role defined by Frege’s constraint, and nothing 
can be a mode of presentation unless it plays that role. To ask what 
modes of presentation are is just to ask what things play that role. 
Presently I shall raise the question of what modes of presentation 
might be, but the hidden-indexical theory I shall now describe is 30 
far neutral on the outcome of that question. 


As applied to the paradigmatic example 
{1] Ralph believes that Fido is a dog. 


the hidden-indexical theory says that the logical form of an utter- 
ance of this sentence may be represented as 


[2] Gm) (@*m & B(Ralph, (Fido, doghood), m)) 


where &* is an implicitly referred to and contextually determined 
type of mode of presentation. By a type of mode of presentation I 
mean merely a property of modes of presentation; $*, for example, 
might be that property that a propositional mode of presentation 
has when and only when it requires thinking of Fido as being the 
dog who appears in the morning and requires thinking of doghood 
as a property shared by such-and-such similar-looking creatures. 
The reference to a type of mode of presentation is implicit in that, 
although the sentence requires the speaker to be referring to a type 
of mode of presentation whenever the sentence is uttered, there is 
no word in [1] which refers to that type (whence the ‘hidden’ in 
‘hidden-indexical theory’). The reference to the type of mode of 
presentation is “contextually determined” in that different types 
may be referred to on different occasions of utterance (whence the 
‘indexical’ in ‘hidden-indexical theory’). 

As we shall see when I stop baldly stating the theory and begin 
defending it, the reference is to a type of mode of presentation 
rather than to some particular mode of presentation because we 
need not be in a position to refer to an actual mode of presentation 
under which someone believes a proposition. But in a limiting case 
reference may be to an actual mode of presentation, for the “type” 
referred to may be one that uniquely picks out a particular mode of 
presentation, or it may simply be the property of being identical to 
such-and-such mode of presentation. So at one end of the spectrum, 
the type referred to may constitute a reference to a unique mode of 
presentation; at the other end, the type might be vacuous, as when 
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the speaker’s import in uttering [1] is merely that Ralph believes the 
proposition that Fido is a dog under some mode of presentation or 
other. In between, we have reference to substantial types that do not 
determine unique modes of presentation. 

So the representation of [1] as [2] tells us quite a bit. It tells us 
that a correct compositional truth theory for English must construe. 
‘believes’ as a three-place relational predicate holding among be- 
lievers, Kaplan propositions, and modes of presentation of those 
propositions. It tells us that [1]’s ‘that’-clause, ‘that Fido is a dog’, is 
a referential singular term whose referent is the singular proposi- 
tion (Fido, doghood), and it tells us that the references of ‘Fido’ 
and ‘dog’ in that ‘that’-clause are Fido and doghood, respectively. It 
tells us that an utterance of the sentence requires reference to a type 
of mode of presentation, so that an utterance of [1] is true just in 
case [2], where ®* is the mode-of-presentation type referred to in 
the utterance. And it tells us that this contextually determined refer- 
ence to a type of mode of presentation is by a “hidden indexical”’ in 
that there is no actual indexical in [1] which carries this reference. 
In this respect, belief ascriptions are like other sentences containing 
hidden indexicals. For example, in uttering ‘It’s raining’, the 
speaker must be referring to some place at which it is raining (typi- 
cally, this is the speaker’s location, but it need not be: a speaker in 
Chicago may reply ‘It’s raining’ when asked about the weather in 
New York). 

I REMARKS IN SUPPORT OF THE HIDDEN-INDEXICAL THEORY 


According to the hidden-indexical theory, an utterance of the sen- 
tence ‘A believes that S” is true just in case the utterer is referring to 
a type of mode of presentation ® and the referent of ‘A’ believes the 
Kaplan proposition referred to by ‘that S under some mode of 
presentation of the type ®. My claim is that this theory provides the 
best account of how belief ascriptions must be accommodated in a 
compositional truth theory for English if English has one. A full- 
scale defense of this claim would outrun a feasible number of pages, 
but I would like to give some sense of what supports the theory. 

One very important thing the hidden-indexical theory tells us is 
that the ‘that’-clause in a paradigmatic belief ascription such as [1] is 
a referential singular term. This at least has the appearance of being 
right: [1] does indeed seem to be telling us one of the things Ralph 
believes—to wit, that Fido is a dog. On the face of it, [1] says that 
Ralph stands in the belief relation to the referent of the singular 
term ‘that Fido is a dog’. This appearance that [1]’s ‘that’-clause, 
‘that Fido is a dog’, is a referential singular term is sustained by the 
evident validity of arguments such as 
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Ralph believes that Fido is a dog, and so does Thelma. 
So, there is something that they both believe (viz., that Fido is a dog). 


Ralph believes everything that Thelma says. 
Thelma says that Fido is a dog. 
So, Ralph believes that Fido is a dog. 


Ralph believes that Fido is a dog. 
That Fido is a dog is impossible. 
So, Ralph believes something that 1s impossible (viz., that Fido is a dog). 


For how else are we to account for the validity of these inferences 
other than on the assumption that ‘that’-clauses are referential sin- 
gular terms? I think that the rhetorical force of this question is 
correct: the best explanation of the way in which these arguments 
may be valid does presuppose that ‘that’-clauses are referential sin- 
gular terms.® 

It is obvious that we may give simple and straightforward explana- 
tions of the validity in question via hypotheses that entail that ‘that’- 
clauses are referential singular terms. The question is whether valid- 
ity can be accounted for without that assumption. Perhaps, as I 
myself argued in Remnants of Meaning,’ the arguments are valid 
only when their quantifications are read nonobjectually, say, as some- 
thing akin to substitutional quantification. This would entail that 
‘that’-clauses were not referential singular terms. Or perhaps, as 
Graeme Forbes? has argued, we can use some very fancy footwork to 
account for the validity even though [1]’s ‘that’-clause disappears on 
analysis, leaving us with an existential generalization along the lines 
of 


(im) Gm’) (m is a mode of presentation of Fido & mw’ is a mode of 
presentation of doghood & B(Ralph, the proposition that m has m’) 


Or perhaps we could salvage enough validity using Donald David- 
son’s ‘‘paratactic” theory of propositional-attitude ascriptions, 


€ I say ‘the way in which these arguments may be valid’ because of the alleged 
hidden-indexical nature of belief ascriptions: qua inference type, 

Ralph believes that Fido is a dog. 

So, Ralph believes that Fido is a dog. 

is no more valid than 

It’s raining. 

So, it’s raining. 

Tokens of these inference types are valid only when premuse and conclusion make 
the same implicit reference. 

7 Cambridge: MIT, 1987. 

2 “The i ity of Senn,” The Philosophical Review, xcix, 4 (1990): 
535-63 
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wherein the only singular term referring to what Ralph believes is 
the demonstrative ‘that’, as it occurs in 


Ralph believes that. Fido is a dog. 


when its reference is the utterance that follows it of ‘Fido is a dog’. 

Perhaps; but I doubt it. I have argued elsewhere® against David- 
son’s proposal, and if it were not for present constraints I would 
offer to explain why neither Forbes’s line nor my former self’s non- 
objectual-quantification line can be made to succeed.!° But there 
are the constraints; so I must ask you provisionally to assume that I 
have justified the assumption that ‘that’-clauses are referential singu- 
lar terms. Then we may move to the question this assumption 
invites. 

If ‘that Fido is a dog’ is a referential singular term, then we may 
ask what its reference is. To what, that is, does it refer? This, how- 
ever, is easy to answer: ‘that Fido is a dog’ refers to that Fido is a 
dog; what ‘that Fido is a dog’ refers to is that Fido is a dog. But what 
is this thing, that Fido ts a dog, which is the referent of the ‘that’- 
clause singular term? Well: (i) it is abstract, in that it has no spatial 
location. That Fido is a dog is not under the kitchen sink or any- 
where else. (ii) It is mind- and language-independent in that it exists 
in possible worlds in which there are neither thinkers nor speakers. 
That Fido is a dog is also language-independent in that, while it may 
be the content of a sentence of any language, it itself belongs to no 
language; it is not a linguistic entity. (iii) It has a truth condition, and 
it has its truth condition essentially, in contrast to the contingent 
way sentences have theirs. It is a contingent fact that ‘Fido is a dog’ 
is true iff Fido is a dog; if the conventions of English had been 
different, it might have had a totally different truth condition or 
none at all. But it is a necessary truth that that Fido is a dog is true 
iff Fido is a dog. It follows, then, that the referents of ‘that’-clauses 


°? Remnants of Meantng, ch. 5. 
1° Well, I guess I can afford to mention one especially devastating problem for 
the Forbes line Applied to 


Ralph said that Fido us a dog 
1t yields 

(on) Gar’) (m is a mode of presentation of Fido & w 1s a mode of presentation 

of doghood & S(Ralph, the proposition that įm has m) 
The trouble with this is that it 1s extremely unlikely that Ralph, ın hus utterance of 
‘Fido is a dog’, will have said any such mode-of-presentaton-containing proposi- 
tion If he did mean some such proposition, then there would be a ification of 
what he said that is other than ‘that Fido is a dog’ and that refers to a mode-of- 
presentation-containing proposiuon But it is clear that there need be no such 
alternative specification of what he said. 


BELIEF ASCRIPTION 507 


are propositions, in the philosophical sense of that term: abstract, 
mind- and language-independent objects that have essentially the 
truth conditions they have. 

If the hypothesis that ‘that’-clauses are referential singular terms 
is to cohere with the assumption that English has a compositional 
truth theory, then we must suppose that each ‘that’-clause is a se- 
mantically complex singular term whose reference is determined by 
its syntax and the references, or extensions, of its component words. 
What, we might then ask, are the references of ‘Fido’ and ‘dog’ in 
‘that Fido is a dog’, as it occurs in an utterance of [1]? 

For all intents and purposes, there are but two possible answers. 
One, apparently Frege’s, is that the terms in a ‘that’-clause refer to 
modes of presentation and the proposition referred to by the entire 
‘that’-clause is a structured proposition made up of those modes of 
presentation. With respect to a particular utterance of [1], the Fre- 
gean proposal would be that the token of ‘Fido’ refers to a particu- 
lar mode of presentation of Fido while the token of ‘dog’ refers to a 
particular mode of presentation of doghood, the utterance of [1] 
thus enjoying the representation 


, B(Ralph, (sty, may) 


(me, mg) being the mode-of-presentation-containing proposition re- 
ferred to by the occurrence of ‘that Fido is a dog’ in the utterance. 
This will be true, of course, only if ms and m, are ways Ralph has of 
thinking of Fido and doghood, respectively. 

The merit of this proposal is that it allows us to see how the 
ancient astronomer was able to believe that Hesperus was Hesperus 
long before she believed that Hesperus was Phosphorus. Its demerit 
is that it is false. For consider 


[3] Everyone who has ever known her has believed that Madonna was 
musical. 


According to the Fregean proposal, there is a particular mode of 
presentation m of Madonna and a particular mode of presentation 
m of the property of being musical such that the foregoing utter- 
ance of [3] is true only if everyone who has ever known Madonna 
has believed the proposition (m, m’). Yet this is surely too strong a 
requirement on the truth of [3]. It requires that everyone who has 
ever known Madonna shared a single way of thinking of her and a 
single way of thinking of the property of being musical, and this is 
most unlikely given that there may have been people who knew her 
as a child and then died and that someone like Helen Keller may 
have been among them. If the Fregean proposal were correct, we 
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could prove that [3] was false just on the basis of the virtual cer- 
tainty that, whatever modes of presentation turn out to be, there will 
not be one single mode of presentation of Madonna shared by all 
who ever knew her. But it seems that we cannot disprove [3] in this 
way, and this because we would count it as true if each person who 
has ever known Madonna thought of her as being musical under 


some mode of presentation or other." 

Our question is: What are the references of ‘Fido’ and ‘dog’ in 
[1]’s ‘that’-clause on the assumption that it refers to a proposition 
that is true just in case Fido is a dog? The second possible answer is 
that they are Fido and doghood, respectively, (Fido, doghood) be- 
ing the referent of ‘that Fido is a dog’.’? Certainly this offers a 
proposition with the right truth conditions. The prima facie prob- 
lem with this proposal is that it is apt to suggest—and has been 
taken by many to suggest—that the logical form of [1] may be repre- 
sented as 


[4] B(Ralph, (Fido, doghood)) 


And you may be very quickly reminded of the well-known problem 
with this representation if I momentarily change the example. 
Surely, Lois Lane does not believe that Clark Kent flies, though she 
does believe that Superman flies. But this is impossible according to 


11 Here is a longer way of showing that ‘that’-clauses do not refer to mode-of- 
presentation-containmng propositions. Intuitively, ‘believes that Fido is a may 
be unrvocally true of people who think of Fido and doghood in radically different 
ways (you and Helen Keller may be among them) and who do not share any single 
mode of presentation for either Fido or doghood This shows that ‘that Fido 
is a dog’ makes no context-independent reference to a mode-of-presentation- 


con proposition At the same time, a speaker may truly say so-and-so 
ea panes ido is a dog even though she 1s not m a position to refer to any 
particular mode of presentation so-and-so has for either Fido or doghood This 
shows that the ‘that’-clause makes no context-dependent reference to a mode- 
of-presentation-containing proposition. And if it makes neither a context- 
mdependent nor a context-dependent reference to such a proposition, then it 
makes no reference to one. 

1? Friends of possible-worlds semanucs or of counterpart theory will think my 
rush to singular propositions a bit cavalier. Why not say that the reference of 
‘Fido’ is a function from possible worlds to referents of ‘Fido’ in those worlds— 
Fido himself, perhaps, in each world, or variable counterparts of our Fido? First, 
I think importation of possible worlds is unmotivated, though I cannot argue that 
here. Second, I think any prima facie plausible theory that used possible worlds 
and propositions that were constructions based on would, when the dust 
settled, simply yield alternative styles of ron ras | Third, nothing that I really 
care about changes if, say, you opt for a version of the hidden-indexical theory 
wherein ‘Fido’ in a ‘that'-clause refers to that constant function that maps each 
possible world onto our Fido (but the counterpart theory might be a version of 
the rejected proposal that the referent is a mode of presentation of Fido). 
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the present proposal, since, as Clark Kent = Superman, the proposi- 
tion that Superman flies = the proposition that Clark Kent flies. 
Likewise, reverting now to the ongoing example, Ralph may believe 
that Fido is a dog without believing that Fido is a shmog, and he may 
believe that Fido is male without believing that Fi Fi is male. Yet all 
this is impossible according to the theory in question, since it implies 
that the proposition that Fido is a dog = the proposition that Fido is 
a shmog and that the proposition that Fido is male = the proposi- 
tion that Fi Fi is male. The problem is even manifested in belief 
ascriptions involving indexical reference. For example, pointing to 
one mug shot, Thelma might say, ‘I believe that he is the culprit’, 
while pointing to another, she might say, ‘I neither believe nor dis- 
believe that this guy is the culprit; I simply can’t tell’. Intuitively, 
both of her utterances are true, even though, as it may happen, the 
same man is referred to in each utterance; but this, again, is incom- 
patible with the proposal in question. 

Nathan Salmon’® has tried to defend the extreme ‘Fido’-Fido 
theory of belief that would represent [1] as [4] by trying to explain 
away the patina of counterintuitiveness adhering to the theory’s 
consequences. But I think he has not succeeded, and in “The ‘Fido’- 
Fido Theory of Belief’ I try to show why.'* We really do need a 
theory that allows us to see how an utterance of 


Lois believes that Superman flies but doesn’t believe that Clark Kent 

flies. 
can be literally true. 

Here, then, is our situation. We are motivated to see [1]’s ‘that’- 
clause as a referential singular term whose reference is the proposi- 
tion that Fido is a dog. Since we are assuming that English has a 
compositional truth theory, we must hold the reference of this 
‘that’-clause to be determined by its syntax and the semantic values 
the words in the ‘that’-clause have in it. Those semantic values— 
those references—cannot be modes of presentation, and the only 
viable option is that they are Fido, for ‘Fido’, and doghood, for 
‘dog’. At the same time, we cannot accept the representation of [1] 
as [4], for we also want to allow that Ralph does not believe that 
Fido is a shmog. 

Enter now on its white charger the hidden-indexical theory, which 
satisfies all desiderata. This theory, you will recall, represents the 


13 Frege's Puzzle (Cambridge: MIT, 1986). 
M See also Salmon’s reply, “logical Belief,” Philosophical Perspectrves, m 
(1989) 243-85. 
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logical form of an utterance of [1] as [2], where &* is an implicitly 
referred to and contextually determined type of mode of presenta- 
tion. Thus, ‘that Fido is a dog’ in [1] is a referential singular term 
whose referent is the singular proposition (Fido, doghood), and 
thus the referents of ‘Fido’ and ‘dog’ in that ‘that’-clause are Fido 
and doghood, respectively. At the same time, we can see how an 
utterance of 


[5] Ralph does not believe that Fido is a shmog. 


can also be true, even though, as shmoghood = doghood, ‘shmog’ in 
[5] has the same reference as ‘dog’ in [1], so that the ‘that’-clause in 
each sentence refers to one and the same singular proposition, to 
wit, (Fido, doghood (i.e., shmoghood)). That the utterances of [1] 
and [5] can both be true is due, of course, to the hidden-indexical 
feature: in the normal case, the type of mode of presentation re- 
ferred to in an utterance of [1] will be different from that referred 
to in an utterance of [5]. Likewise in the other examples discussed, 
and likewise as regards the London/Londres and Paderewski exam- 
ples that puzzled Saul Kripke." These are no puzzles for the 
hidden-indexical theory. 
Ill. THREE PROBLEMS FOR THE HIDDEN-INDEXICAL THEORY 

So much for what recommends the hidden-indexical theory. I shall 
now raise three problems for the theory. 

The candidate problem. This problem arises for any theory that 
appeals to modes of presentation. It is a problem I have discussed at 
length elsewhere,’* and I will say only enough about it now to indi- 
cate its nature. In a nutshell, the candidate problem is that there is 
some question whether there is a plausible candidate for what 
modes of presentation are. 

The notion of a mode of presentation is, we saw, a functionally 
defined technical notion: a mode of presentation is whatever plays 
the mode-of-presentation role defined by Frege’s constraint. So we 
may ask what things play that role. This is like asking what genes are, 
the notion of a gene being functionally defined as whatever is re- 
sponsible for the transmission of hereditary characteristics. There 
the answer turned out to be segments of DNA molecules. What, 
though, is the answer to the mode-of-presentation question? There 
is no dearth of proposed answers: 


15 “A Puzzle about Belief,” in A. Margalit, ed , Meaning and Use (Dordrecht: 
Reidel, 1979). 

16 “The Basis of Reference,” Erkenntnis, xm, 1 (1978). 171-206; Remnants of 
Meaning, esp. ch 3; and “The Mode-of-Presentation Problem.” 
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Modes of presentation are individual concepts, uniqueness proper- 
ties of the form the property of betng the unique tnstantiator of such-and- 
such property. 

Modes of presentation are simply general properties that make no 
pretense of applying uniquely to the things of which they are modes of 


presentation. 

Modes of presentation are percept tokens, actual instances of sensory 
experience. 

Modes of presentation are ‘‘stereotypes,”” perhaps what cognitive psy- 
chologists call prototypes. 


Modes of presentation are ‘‘characters,”’ functions from contexts of 
the utterance of an expression to the expression’s contents ın those 
contexts. 

Modes of presentation are public language expressions. 

Modes of presentation are Mentalese expressions, formulae in the 
neural language of thought. 

Modes of presentation are functional roles, perhaps conceptual roles 
of Mentalese expressions. 

Modes of presentation are causal chasns linking Mentalese names 
and predicates to the objects and properties for which they stand. 

And finally there is the no-theory theory of modes of presentation, 
which rejects what I have elsewhere called the ininnsic-description con- 
straini.” 


This constraint holds that if a thing is a mode of presentation—if, 
that is, it plays the mode-of-presentation role—then it must be in- 
trinsically identifiable in a way that does not describe it as a mode of 
presentation or as a possible mode of presentation. If a thing is a 
mode of presentation, then it must be intrinsically identifiable as 
some other kind of thing. Rejecting this constraint, the no-theory 
theorist of modes of presentation holds that modes of presentation 
are modes of presentation and there is an end to it. You simply 
cannot say what they are in any other terms. 

But I think the intrinsic-description constraint is well-motivated, 
and I think there are compelling objections against each of the fore- 
going candidates but one. The exception is the view that modes of 
presentation are functional roles, perhaps conceptual roles of Men- 
talese expressions. This view of what modes of presentation are has 
intrinsic problems and it does not cohere with every theory of the 
logical form of belief ascription; but its problems do not refute it and 
it does cohere with the hidden-indexical theory.'* In any case, the 


17 “The Mode-of-Presentation Problem.” 

™ This corrects a flawed t in ‘The Mode-of-Presentation Problem” 
which was intended to show modes of presentation cannot be conceptual 
roles of Mentalese expressions. The argument overlooked what must be the in- 


512 THE JOURNAL OF PHILOSOPHY 


hidden-indexical theory entails that believing is a relation among a 
believer, a proposition, and a mode of presentation of that proposi- 
tion, which mode of presentation is in turn determined by modes of 
presentation of the proposition’s components. It is hard to see how 
we can reasonably accept this theory without some well-motivated 
idea of what these modes of presentation are supposed to be. 

The meaning-intention problem. This problem is that one 
may reasonably doubt that belief ascribers mean what the hidden- 
indexical theory requires them to mean when they ascribe beliefs. 
Let me explain. 

Floyd’s sister calls him in Chicago and asks about the weather. 
Floyd replies, “It’s raining.” Here Floyd refers to Chicago, and this 
by virtue of the fact that in uttering ‘It’s raining’, Floyd means that it 
is raining in Chicago. In other words, Floyd counts as having re- 
ferred to Chicago because the proposition he meant is about Chi- 
cago. Notice that there is no difficulty whatever in ascribing to Floyd 
the propositional speech act in question: he clearly intended his 
sister to believe that it was raining in Chicago, and he is quite pre- 
pared to tell you that this is what he meant, what he implicitly said, 
and what he intended her to be informed of. Because these things 
are so clear, Floyd’s utterance is a paradigm of implicit reference. 

Now for a different case. During a casual conversation about air- 


fare bargains, Flora says, 


[6] Harold believes that TWA is offering a New York-Paris return 
fare for $318. 


According to the hidden-indexical theory, there is a property ® of 
modes of presentation of the proposition that TWA is offering a 
New York-Paris return fare for $318 such that Flora referred to ® 
in her utterance. If this is true, then, presumably, it is because Flora, 
in producing her utterance, meant some proposition about Ẹ in just 





tended generalization of the hidden-mdexical theory for belief ons with 
embedded ‘that’-clauses—namely, that a belief ascription with n ‘that’-clauses 
embedded m the primary ‘that’-clause must be making n + 1 implicit references 
to types of modes of presentation Thus, the logical form of an utterance of 


Floyd believes that Lois believes that Superman flies. 
is 

Gm) ($m & B(Floyd, [Gon’) ($m & B(Lois, [Superman flies], »))], »)) 
where @ and & are implicitly referred to and contextually determined types of 
modes of presentation (and where ‘[. . .]’ is a smgular-term forming operator with 
the force of ‘the Aoi es that .. ."). If this inevitable complexity weakens the 
plausbility of the hi indexical theory, then so be it. 
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the way that Floyd, in his utterance, meant some proposition about 
Chicago. But, we shall see, it is doubtful that Flora meant any 
such thing. 

We had no trouble saying what Floyd meant in uttering ‘It’s rain- 
ing’: he meant that it was raining in Chicago. But what that implies a 
reference to a type of mode of presentation does Flora mean in 
uttering [6]? It must surely be 


[7] that there is something that both has & and is such that Harold 
believes the proposition that TWA is offering a New York-Paris 
return fare for $318 under it 


for some particular property ® of modes of presentation of the 
proposition that TWA is offering a New York-Paris return fare for 
$318. But—and this is where the trouble begins—neither Flora nor 
her audience is aware of her meaning any such thing. Floyd is 
consciously aware of both the form of the proposition he meant (he 
is aware that he meant that it is raining in Chicago) and the implic- 
itly referred to thing the proposition is about (he is aware that he 
meant that it is raining in Chicago). But it is doubtful that the non- 
philosopher Flora has conscious access to the form of [7], and it is 
especially doubtful that she has conscious awareness of referring to 
any mode-of-presentation property. That is to say, if she did mean a 
proposition of form [7], then she has no conscious awareness of 
what property ® that proposition is about, and no conscious aware- 
ness, therefore, of what she meant in uttering [6]. 

Neither we nor Flora can say what mode-of-presentation property 
was referred to in her utterance of [6]. Just try to say it. The 
referred-to property, if it exists, would be given by a completion of 
the form 


The property of being an m such that to believe the proposition that 
TWA is offering a New York-Pans return fare for $318 under m re- 
quires thinking of TWA, the offering relation, New York, Paris, the US 
dollar currency, the number 318 [and so on for the other components 
of the proposition] in ... ways, respectively. 


But the nonphilosopher Flora has no access even to this form of 
specification, and who among us can offer to replace the three dots? 
Now, of course, it is in principle possible to have knowledge of such 
a property even though one cannot specify it in the forgoing way. 
One might, for example, have knowledge of it under some quite 
extrinsic description. But I submit that it will be obvious on reflec- 
tion that Flora has no such alternative way of explicitly picking out a 
property of modes of presentation to which she is implicitly refer- 
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ring in her utterance of [6].1° There is no sentence o such that Flora 
can say ‘I meant that o’, where ‘that o’ explicitly refers to a proposi- 
tion of [7]’s form. 

Thus, if the hidden-indexical theory is correct, then Flora has no 
conscious awareness of what she means, or of what she is saying, in 
uttering [6], and this is a prima facie reason to deny that she means 
what the theory is committed to saying she means. 

There is more than one way a hidden-indexical theorist might 
respond to this prima facie problem, but let me go directly to what I 
think is the theorist’s best bet. This response is to concede that Flora 
is not aware of meaning a proposition of form [7] under any mode 
of presentation, but to insist that she nevertheless meant such a 
proposition. The idea is to appeal to so-called “tacit belief’ and 
“tacit intention.” This idea has its origin, I believe, in Noam 
Chomsky’s controversial claim that speakers know a grammar of 
their language. To this it was protested that in no ordinary sense of 
‘know’ do speakers know any such thing. When the dust settled, 
many theorists supposed that the best construal of Chomsky’s claim 
was that the correct theory of language processing implies that a 
speaker has an internal representation of the grammar of her lan- 
guage, where this internal representation is stored not as a belief, 
but as what Stephen Stich was later to call a subdoxastic state.™ 
Subdoxastic states are representational states that are belief-like in 
some respects but differ from paradigmatic beliefs in their unavail- 
ability to consciousness and the way they are inferentially isolated 
from beliefs (for example, if you believe that if p, then q, and believe 
p, you are likely to infer q; but you will not make this inference if the 
state representing p is subdoxastic). Now, the notion of a tacit prop- 
ositional attitude can admit of a spectrum of cases, from the 
Chomsky-required subdoxastic states at one end, to states that 
may reasonably be claimed to be subsumable under our current 
propositional-attitude concepts, at the other. What is evidently uni- 
fying about the notion of a tacit propositional attitude is that a tacit 
propositional attitude is unavailable to consciousness and that one 
cannot be made irrational by conflicts between those of one’s 


19 Besides, it is quite against the spint of the hidden-indexical theory that the 
type of mode of presentation referred to should be one that the speaker knows 
only under some nonintrinsic mode of presentation. The idea of the theory 18 
that, though one might not know the particular mode of presentation under 
which so-and-so believes that such-and-such, still one has access to some interest- 
ing description that the mode of presentation satisfies. 

‘Beliefs and Subdoxastic States,” Philosophy of Science, xLv (1978): 
499-518. 
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thoughts which are available to consciousness and those which are 
merely tacit. At any rate, the present proposal is that it is thanks to 
her tacit beliefs and intentions in uttering [6] that Flora means a 
proposition of [7]’s form even though she is not aware of meaning 
any such thing. 

One trouble with the tacit-intention proposal is that it induces a 
rather radical error theory: not only do ordinary belief ascribers have 
no conscious knowledge of what they are asserting, they also turn 
out not to have the conscious thoughts they think they have. Flora 
clearly thinks she has conscious knowledge of what she is asserting in 
uttering [6]. She is quite prepared to say, “Look, what I am saying, 
and all that I am saying, is that Harold believes that TWA is offering 
a New York-Paris return fare for $318.” In other words, she thinks 
she is consciously aware of what she is asserting in uttering [6], but 
the tacit-intention line implies that here she is in error: the only 
proposition she asserts in uttering [6]—viz., some proposition of 
form [7]—1is not anything of which she is conscious. What makes 
this error aspect of the tacit-intention proposal problematic is not 
merely that it riddles the propositional-attitude ascriptions of ordi- 
nary speakers with error; it also forces us to qualify our views about 
first-person authority in an important way. Flora does not have the 
privileged access to what she consciously means, intends, and be- 
lieves in uttering [6] which one might reasonably have supposed to 
be part of a normal person’s functional architecture. 

A more serious trouble with the tacit-intention proposal actually 
constitutes a direct objection to the claim that speakers have the 
meaning intentions the hidden-indexical theory requires them 
to have, whether or not those intentions are claimed to be tacit. 
But the objection is best presented here, in response to the tacit- 
intention proposal, since it is obvious that speakers do not 
consciously mean what the hidden-indexical theory entails that they 
mean. The objection takes hold even if it is conceded that ordinary 
people do have some sort of indirect awareness of modes of presen- 
tation, which is all to the good, as it is not out of the question that 
one could motivate the idea that Flora tacitly believes that Harold 
believes the proposition about TWA under a certain type of mode of 
presentation. The point to be pressed, however, is that considerably 
more is required of tacit meaning. I shall explain. 

Meaning entails audience-directed intentions, and one cannot 
mean something without intending to be understood. Part of mean- 
ing that such and such is intending one’s audience to recognize that 
that is what one meant, and—a corollary—part of referring to a 
thing is intending one’s audience to recognize that reference. Con- 
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sequently, the present tacit-intention proposal requires there to be 
some type of mode of presentation ® such that Flora tacitly means 
that Harold believes that TWA . . . under a mode of presentation of 
type ®, and tacitly intends her audience to recognize (no doubt 
tacitly) that she means that Harold believes that TWA ... under a 
mode of presentation of type ®. But as James Higginbotham 
pointed out to me, this puts the implausibility of Flora’s meaning a 
proposition of form [7] on a par with that of a certain way of trying 
to extend the description theory to “incomplete” definite descrip- 
tions. The idea there is that, when a speaker utters a sentence like 
“The dog has fleas’, then there must be some property ® such that 
she means the proposition that the thing that is uniquely a dog and 
® has fleas. What makes this so implausible is that there will typically 
be a number of potentially completing descriptions that are equally 
salient in the context (e.g., the dog that I own, the dog we are both 
looking at, the spotted dog in the green chatr, and so on). Conse- 
quently, it is highly implausible that there will be one such descrip- 
tion such that the speaker intends it to be understood between her 
and her audience that she means a proposition containing that very 
description.”! 

Now, just the same problem arises with the meaning claim re- 
quired by the hhidden-indexical theory. For assume that propositions 
are believed under modes of presentation. If a proposition is be- 
lieved under one mode of presentation, then it will typically be be- 
lieved under many modes of presentation. Further, each of those 
modes of presentation will instantiate infinitely many types of modes 
of presentation, many of which will be equally salient in the com- 
municative context. This makes it extremely implausible that of all 
the equally salient types of ways that Harold has of believing the 
proposition about TWA, Flora should mean—and intend to be 
taken to mean—a proposition about one definite one of them. 

A little thought experiment should make the objection vivid. 
When considering the hypothesis that a certain expression has a 
certain semantic property, it is often helpful to introduce an expres- 
sion that has that semantic property by stipulation. Presently in de- 
bate is the hidden-indexical theory’s claim that one uttering [6] 
would mean a proposition of the form [7] displays. So let us intro- 
duce ‘shmelieves’ as a term that satisfies the hidden-indexical theory 
by stipulation: to utter a sentence of the form 


2! See my ‘‘Indexicals and the Theory of Reference,” Synthese, XLIX, 1 
(1981):48—100; esp. pp. 77-8. 
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A shmelieves that S. 


with its literal meaning requires a speaker to mean 
that there is something that both has @ and is such that A believes the 
proposition that S under it 


for some particular contextually determinate property # of modes 
of presentation of the proposition that S. The hidden-indexical 
theory may or may not be true of ‘A believes that S’, but it is true by 
stipulation of ‘A shmelieves that S’. 

So far so good, but let me now try to use this newly minted coin to 
make an assertion. 

Placido Domingo shmelieves that Luciano Pavarotti will be arriving at 

Only airport today at around 4 PM Paris time. 

Very well, what assertion did I make? What is the type of mode of 
presentation ® such that I meant that Placido Domingo believes the 
proposition that Luciano Pavarotti will be arriving at Orly airport 
today at around 4 PM Paris time under a mode of presentation that 
has &? Remember, ® is made up of types of modes of presentation 
for each of the constituents of the proposition to which my ‘that’- 
clause refers: Luciano Pavarotti, the arrival relation, Orly airport, 
the number 4, Paris, etc. The question, of course, cannot be an- 
swered. For although ‘shmelieves’ was stipulatively given a perfectly 
good hidden-indexical semantics, I was not, in uttering the displayed 
sentence, in a position assertively to use it, and this because I was 
not in a position to refer to a contextually determinate type of mode 
of presentation. None is sufficiently salient to enable you, my au- 
dience, to identify it as the one J meant, and this notwithstanding 
the fact that we understand all the concepts involved and everything 
has been painstakingly raised for us to the level of conscious 
awareness. 

The application to the hidden-indexical theory should be clear. 
Although I cannot utter the forgoing ‘shmelieves’ sentence and 
mean what its literal meaning requires me to mean, there are no 
such impediments to my uttering ‘Placido Domingo believes that 
Luciano Pavarotti will be arriving at Orly airport today at around 4 
PM Paris time’. I can perfectly well utter this and hope to have 
my literal assertion be perfectly well-understood by you. Since 
‘shmelieves’ is by stipulation what ‘believes’ would be if the hidden- 
indexical theory were true, this appears to show that the theory is 
not true. The best account of why neither Flora nor her audience 
(who understood her perfectly well) is aware of her meaning a prop- 
osition of form [7], as the hidden-indexical theory requires her to 
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mean, is not that the requisite meaning intentions have gone tacit; it 
is that she does not mean any such thing. 

The logical-form problem. According to the hidden-indexical 
theory, ‘believes’ is a three-place relational predicate. But this might 
strike one as fishy for the following reason. If ‘believes’ really were 
three-place, then it ought to have occurrences where its three-place 
form is explicit, as the three-placedness of ‘gives’ is made explicit 
when we move from ‘She gave the house’ to ‘She gave the house to 
her husband’. Yet in what analogous way can we make explicit the 
three-placedness of ‘Ralph believes that Fido is a dog’? Perhaps in 
the way of 


[8] Ralph believes that Fido is a dog in way w/under mode of presenta- 
tion m. 
There are, however, two problems with this. 

First, this is no ordinary-language specification but technical jar- 
gon. If one is told that Mary gave the house, one can ask, “To whom 
did she give the house?” But what is the corresponding ordinary- 
language question for ‘Ralph believes that Fido is a dog’? Ask a 
nonphilosopher, “In what way, or under what mode of presenta- 
tion, does Ralph believe this?” and your best answer will be a puz- 
zled look. 

Second, [8] does not look like a specification of a three-place 
relation. It looks more like a two-place relation with an adverbial 
qualifier. In other words, it is to be assimilated to 


Ralph kissed her in the most exciting way. 


where ‘the most exciting way’ is merely part of an adverb, rather 
than to 


[9] Mary gave the house to her husband. 


where ‘her husband’ is an argument of the verb ‘to give’. Evidence 
for this assimilation comes from well-known constraints in syntactic 
theory which distinguish between arguments and adverbs in their 
ability to extract from ‘whether’-clauses.” Thus, the argument sta- 
tus of ‘her husband’ in [9] is revealed in the fact that we can answer 
‘Her husband’ in response to the question 


To whom did you wonder whether Mary gave the house? 


2 See Chomsky, Barners (Cambridge: MIT, 1986). I owe this reference and the 
point about ‘whether’-clauses to Robert Fiengo. 
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And the nonargument, adverbial status of ‘way w/mode of presenta- 
tion m’ in [8] is revealed by the fact that it cannot be given in answer 
to the question 


In what way/under what mode of presentation did you wonder 
whether Ralph believes that Fido is a dog? 


IV. THE IMPORTANCE OF THE ISSUE 

I have proposed that, if natural languages have compositional truth 
theories, then the hidden-indexical theory is the correct account of 
the accommodation of belief ascriptions in them. I have also offered 
some reason for thinking the hidden-indexical theory is false. If it is 
both conditionally correct and false, then it follows that natural 
languages do not have compositional truth theories. This is one 
reason the issue of the semantics of belief ascriptions is important. 
There is, however, another reason for the importance of the ques- 
tion of the accommodation of belief ascriptions in a compositional 
truth theory; it has to do with the bearing of this question on the 
vexing question about the form that a compositional meaning theory 
for a language must take. I shall explain. 

It is commonly assumed that each natural language L has a compo- 
sitional meaning theory, this being a finitely axiomatizable theory of 
L whose theorems ascribe to each sentence of L its meaning in L. 
Those who make the assumption reason that without it there will be 
no explaining either the platitude that the meaning of a sentence is 
determined by its syntax and the meanings of its component words 
or the fact that we have the ability to understand indefinitely many 
novel sentences. But while it is generally agreed that every natural 
language enjoys a compositional meaning theory, no three philoso- 
phers can be made to agree on the form such a theory must take. 
For a start, it is not even clear what form the theorems of a meaning 
theory would have to take. You might at first blush suppose that a 
compositional meaning theory for L would generate theorems of 
the form 


‘S means in L that such and such. 


but this could hardly be the form of a theorem for sentences like 
‘He was there’ or ‘Stop singing, you idiot!’ and it is anything but 
clear that the word ‘means’ should ever be used in saying what a 
sentence means. After all, to know the meaning of a sentence is to 
be able to know what a speaker is saying in uttering the sentence; so 
it is reasonable to suppose that a compositional meaning theory 
would have to ascribe to each sentence a feature, knowledge of 
which would enable us to know what would be said in the utterance 
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of any sentence having that feature. But there is no agreement as to 
what such a feature would be like, or that the word ‘meaning’ would 
need to be used to ascribe it. 

Those who assume that each natural language has a compositional 
meaning theory typically assume that each natural language also has 
a compositional truth theory, in the sense already glossed. After all, 
it is part of the meaning of ‘Snow is white’ that an utterance of it is 
true iff snow is white; so if it is reasonable to think a compositional 
theory can assign meanings to sentences, then it is reasonable to 
think one can assign truth conditions to them, too. What we are now 
in a position to see is that, if we know how to accommodate belief 
ascriptions in a compositional truth theory, then we will be able to 
answer the question about the form that a compositional meaning 
theory must take. This ought at first to seem a little surprising. 
Merely knowing how to accommodate ‘Fido is a dog’ in a truth 
theory would not per se tell us the first thing about how to accom- 
modate that sentence in a meaning theory. So how can knowing how 
to accommodate belief ascriptions in a truth theory tell us how to 
accommodate any sentence in a meaning theory? 

I shall forgo a general answer, but we can see straightway how the 
hidden-indexical theory of belief ascription constrains the nature of 
a meaning theory. For what applies to belief ascriptions applies, 
mutatis mutandis, to all propositional-attitude ascriptions involving 
‘that’-clauses. What holds for ‘Ralph believes that Fido is a dog’ also 
holds for ‘Ralph says that Fido is a dog’. The theory tells us that the 
‘that’-clause in a saying ascription refers to a Kaplan proposition. 
This means that to know what a speaker said in the utterance of a 
sentence is to know what Kaplan proposition she said. Since to un- 
derstand a language is to have the ability to understand utterances ~ 
in it, and since to understand an utterance is to know what the 
speaker said in it, we can see that, if the hidden-indexical theory is 
correct, then the main task of a compositional meaning theory for a 
language will be the construction of a finitely specifiable correlation 
that relates each sentence of the language to what Kaplan has called 
a character—a function that, in effect, maps each utterance of the 
sentence onto the Kaplan proposition that is the propositional ob- 
ject of the primary speech act performed in the utterance.” In 
other words, if the hidden-indexical theory of the accommodation 
of propositional-attitude ascriptions in a truth theory is correct, 
then so is the “direct reference” approach to a compositional mean- 


™ Kaplan, ‘“‘Dthat,” in P Cole, ed, Syntax and Semantics 9: Pragmatics (New 
York: Academic, 1978) 
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ing theory, and in this way we can see how the hidden-indexical 
theory constrains the nature of a meaning theory. 

It may be worth noting that this point I have been laboring re- 
verses the usual procedure of those who argue for the direct refer- 
ence treatment of ‘that’-clauses implied by the hidden-indexical 
theory. That procedure begins by assuming a direct reference ap- 
proach to meaning and then draws the consequence as regards the 
propositional content of propositional-attitude ascriptions, which 
consequence entails that ‘that’-clauses are referential singular terms 
whose referents are Kaplan propositions. My procedure has been 
different, and I think less contentious. I started with no assumptions 
about “content” or meaning. My question was wholly about the 
references that must be assigned to words in a belief ascription on the 
assumption that languages had compositional truth theories, the- 
ories whose theorems are of the extensional paradigm 


‘La neige est blanche’ is true in French iff snow is white. 


It was then argued that, relative to the assumption, ‘that’-clauses 
had to be construed as referential singular terms whose referents are 
Kaplan propositions. Then, in the final step, this premise about 
reference was used to derive the general point about meaning. 

V CONCLUSION 
If the hidden-indexical theory is correct, then we not only know how 
to accommodate belief ascriptions in a compositional truth theory 
for English, we also know the form that a compositional meaning 
theory for English must take. But I did not argue that the hidden- 
indexical theory is correct. I merely argued that it is the correct 
theory relative to the assumption that natural languages have com- 
positional truth theories, and I proceeded to raise problems for the 
theory. There are but three possible ways out of this impasse: either 
the problems can be answered, or the hidden-indexical theory is not 
the best accommodation of belief sentences in a compositional truth 
theory, or natural languages such as English do not have composi- 
tional truth theories. Elsewhere I have argued that the third option 
is one that deserves to be taken seriously. Others may disagree; but 
I hope it will be agreed that the hidden-indexical theory and its 
obstacles present a challenge that cannot be ignored. 

STEPHEN SCHIFFER 

CUNY Graduate Center 


™ Remnants of Meaning; “Does Mentalese Have a Compositional Semantics?” 
in B. Loewer and G. Rey, cds., Meaning in Mind: Essays on the Work of Jerry 
Fodor (Cambridge: Blackwell, 1991); and ““Actual- Language Relations,” forthcom- 
ing in Philosophical Perspectrves, vi (1993). 


522 THE JOURNAL OF PHILOSOPHY 


IN DEFENSE OF THE RAMSEY TEST* 


or more than two decades, much of the philosophical debate 

on conditionals has focused on a proposal by Frank Ramsey’ 

for the evaluation of conditional sentences. According to 
Ramsey, ‘‘[i]f two people are arguing ‘If p will q? and are both in 
doubt as to p, they are adding p hypothetically to their stock of 
knowledge and are arguing on that basis about q” (ibid., p. 247). 
The proposal was developed into a general principle by Robert Stal- 
naker.? In modern studies of belief dynamics, the Ramsey test is 
taken to “presume .. . some method of revising states of belief.” 
Thus, ‘If p then q’ is taken to be believed if and only if q would be 
believed in the belief state that would result from revising the pres- 
ent belief state by p. 

Formal studies of the epistemological application of the Ramsey 
test have gained impetus from the models of belief dynamics that 
have been developed in particular by Isaac Levi and by C. E. Al- 
chourrén, Peter Gärdenfors, and David Makinson.‘ Gärdenfors, 
however, has caused considerable commotion in the belief dynamics 
community by proving that the Ramsey test is incompatible with 
what seem to be weak and reasonable demands on a system of belief 
revision." 

Section I summarizes Gärdenfors’s impossibility theorem and the 
major escape routes from it. In section II, both informal arguments 
and a formal result are presented in support of one of these escape 
routes which leaves the Ramsey test untouched. In section III, a 
formal account of epistemic conditionals is proposed that complies 
with the Ramsey test. This account is extended in section IV to 
iterated conditionals of unlimited complexity. 


* I would like to thank Sten Lindström, Wlodek Rabinowicz, and Krister Seger- 
berg for valuable comments. 

1 Foundations of Mathematics and Other Logical Essays (New York: Routledge, 
1931). 

2“A Theory of Conditionals,” in Nicholas Rescher, ed., Studies m Lo 
Theory, Amencan Philosophical Quarterly Monograph Series, II (Cambridge: 
Blackwell, 1968), pp. 98-112, esp. 101-5. 

* Peter G ora, Knowledge im Flux: Modeling the Dynamics of Epistemsc 
States (Cambridge: MIT, 1988), p. 148. 


4 Levi, The E of Knowledge (Cambridge: MIT, 1980); Alchourrén, 
Gärdenfors, and i “On the Logic of Theory Change: Partal Meet Func- 
tions for Contraction and Revision,” Journal of Logic, L (1985): 510-30; 
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5 “Belief Revisions and the Ramsey Test for Conditionals,” Philosophical Re- 
view, XCV (1986): 81-93 
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I THE IMPOSSIBILITY THEOREM 
Gärdenfors showed the Ramsey test for conditionals to be incompat- 
ible with a set of conditions on an operator * of belief revision 
(ibid.). His proof makes use of the following assumptions: 


(L) The language is governed by a logic including classical truth- 
functional logic, and the consequence relation satisfies the de- 
duction theorem. 

(S) Belief states are identified with belief sets, 1.¢., sets of sentences 
that are closed under logical consequence. 

(0) Belief sets include sentences that contain the conditional con- 
nective > as elements. 

(1) p E K*p (success). 

(2) If K and p are consistent, then so is K*p. 

(R) p >q EK if and only if q E K*p (Ramsey test). 

(P) If ~p Œ K and q E K, then q E K*p (preservation). 

(Œ) Let K be a belief set. Then K + p = {q|p > q E K}, the expan- 
sion of K by p, is a belief set (closure under expansion). 

(NT) There are three sentences p, pa, and ps and some belief set K 
such that p, & pz, pı & ps, and py & ps are all inconsistent and 
that —p, € K, —p, £ K, and —ps € K (nontriviality). 


From (R) the following follows directly: 


(M) For all belief sets K and K’ and all sentences p: if K ¢ K then 
K*p S K™p. 


Gärdenfors was able to prove that (L), (S), (0), (1), (2), M), (P), Œ), 
and (NT) are incompatible. From this follows directly that (L), (S), 
(0), (1), (2), (R), (P), (Œ), and (NT) are incompatible. (Some of these 
conditions were not mentioned explicitly by Gardenfors in his state- 
ment of the theorem, but were only included as background condi- 
tions. The importance of (E) was first highlighted by Hans Rott.®) 

In Gardenfors’s opinion, we have to give up either (R) or (P), 
since the others are “rather innocuous background assumptions.” 
His own choice was to give up (R). Thus, according to its author, the 
impossibility theorem urges us to give up the Ramsey test for episte- 
mic conditionals. Since then, however, several other escape routes 
have been investigated. 

Various ways to weaken (L) have been studied, but the theorem 
has resisted all attacks on this flank. Thus, Krister Segerberg® gener- 
alized the impossibility theorem to a large family of nonclassical 

6 “Conditionals and Theory Change: Revisions, Expansions, and Additions,” 
Synthese, LXXXI (1989): 91-113. 

7 Gärdenfors, “Belief Revisions,” p. 86. 
re Note On An Impossibility Theorem ‘of Girdenfors” Nous, xxm (1989). 
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logics whose consequence relations satisfy monotony (i.e., if X is a 
subset of Y then Cn(X), the set of logical consequences of X, is a 
subset of Cn{Y)). Makinson? has generalized it to important catego- 
ries of nonmonotonic logics that include classical truth-functional 
consequence. 

Several weakenings of (R) have been investigated by Rott, 
Gärdenfors, and others.'° The only weakening that seems to resist 
(reconstruction of) the impossibility theorem is a version developed 
by Sten Lindström and Wlodzimierz Rabinowicz,"! in which revision 
(*) is not applied to the belief set itself but to maximal consistent 
extensions of the belief set. 

The validity of (0) has been questioned by Levi,!® who maintains 
that conditional statements are not “objects of belief which bear 
truth values and are subject to appraisal with respect to epistemic 
possibility and probability” (shd., pp. 49-50). Instead, conditionals 
are “appraisals of serious or epistemic possibility relative to trans- 
formations of the current corpus of knowledge or belief set” (tbid.). 
Therefore, conditional sentences should not be included in belief 
sets. (See below, section III.) 

Rott has argued against (E), i.e., against the assumption that the 
set of belief sets is closed under logical expansion. In his view 
“[e]xpansions are not the right method to ‘add’ new sentences if the 
underlying language contains conditionals which are interpreted by 
the Ramsey test.”!? 

As will be seen in the following section, I agree with Rott that the 
Ramsey test can be saved, and that (E) is the culprit. (More precisely, 
it is the combination of (E), (S), and (0) which I hold to be unten- 
able). My arguments are not the same as Rott’s, however, and my 
formal proposals are different from his (and much more related to 
proposals by Levi).'* 

Il ESCAPING THE IMPOSSIBILITY THEOREM 
I shall leave aside the—certainly very important—metaphysical is- 
sue whether or not a conditional sentence can be true or false in the 
same sense as an indicative sentence. For my present purposes it will 


° “The Gardenfors Impossibility Theorem in Non-Monotonic Contexts,” Stu- 
Logica, xLIXx (1990): 1-6. 

10 Rott, “Ifs, Though and Because,” Erkenninis, xxv (1986): 345-70; 
Gardenfors, “Variations on the Ramsey Test: More Triviality Results,” Studia 
Logica, XLVI (1987): 319-25, and Knowledge in Flux, pp. 160-2. 

“Belief Revision, Epistemic Conditionals and the Ramsey Test,” forthcoming 
in Nous. 

12 “Tteration of Conditionals and the Ramsey Test,” Synthese, LXXVI (1988): 
49-81. 

13 Rott, “Conditionals and Theory Change,” p. 100. 

4 Levi, “Iteration of Conditionals.” 
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be sufficient to make the weak, and likely uncontroversial, assump- 
tion that belief states can support certain conditional sentences. 

In this section, no assumptions will be made about the structure 
of belief states. In particular, they may or may not be sets of sen- 
tences. From a formal point of view, this means that there is a sup- 
port function s such that for any belief state K, s(#) is the set of 
sentences (of all categories) that are supported by H. s(#) will in- 
clude conditional sentences. Depending on the underlying lan- 
guage, it may also include sentences that contain operators of neces- 
sity, possibility, knowledge, belief, etc. 

Let L, be the indicative part of the language, i.e., the part that 
consists only of sentences of truth-functional or predicate logic. Fur- 
thermore, let L, be the subset of the language that contains, in 
addition to the symbols of Lo, only the symbol > for conditionals. 
Then L, N (F6 is the set of indicative sentences that are true in, or 
supported by, #. (The two locutions are synonymous for indica- 
tives.) Lı N (K) contains, in addition, those sentences containing > 
which are supported by #. (According to some metaphysical views, 
these sentences, too, are true in K.) Lo N s(#/) will be called 
the indicative belief set, and L,  s(H) the >-extended belief set 
(conditionals-extended belief set) that is associated with the belief 
state K. 

In most studies in the AGM tradition, belief states have been 
identified with sets of sentences that need not contain modal or 
conditional sentences. Belief states may then be identical to indica- 
tive belief sets, i.e., H = Lo N (K). In discourse on the Ramsey test, 
it has—with the notable exception of Levi’s writings—commonly 
been assumed that belief states are identical to >-extended belief sets, 
i.e., H = L, N s(H). It has not always been sufficiently understood 
that >-extended belief sets differ in their logic, in essential respects, 
from indicative belief sets. Properties of the latter that are so self- 
evident that they are not even explicitly discussed may very well not 
hold of the former. Important examples of this are that a given 
indicative belief set typically has several proper subsets that are in- 
dicative belief sets, and that is also has several proper supersets that 
are indicative belief sets. In other words, for a typical belief 
state # there are (1) several consistent belief states K’ such that 
Lo N SK’) C Lo N SF), and (2) several consistent belief states K” 
such that Ly N (5O C Ly N s(K”). But the same does not hold 
for >-extended belief sets, i.e., it does not hold if L is re- 
placed by L, in the formulas. 

To see why this is so, let us first consider the task of finding, for a 
given >-extended belief set, a proper subset that is also an >- 
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extended belief set. In other words, for a given belief state K we 
want to find a belief state K’ such that £, 9 s(K’) C Li N KH). 
The corresponding task for indicative belief sets (i.e., to find 
some #' such that Lo N s(K') C Ly N SFO) can be solved by per- 
forming any nonvacuous contraction on K! (Unless, of course, 
La N s(K) = Cn(Ø).) This method does not in general work for >- 
extended belief sets, however, since a nonvacuous contraction typi- 
cally provides support for conditional sentences that were not sup- 
ported by the original belief state. If I contract my belief that John is 
mentally retarded, then I gain support for the conditional sentence 
‘If John has lived 30 years in London, then John understands the 
English language’. If I contract my belief that my wife has a head- 
ache, then I gain support for the sentence ‘If it stops raining, then 
my wife would like to join me for an afternoon walk’, etc. 

Next let us consider the task of finding an >-extended belief set 
that is a proper superset of a given >-extended belief set. In other 
words, for a given belief state K we are searching for a belief 
state H” such that £, N KKH) C Lı N s(#"). The corresponding 
task for indicative belief sets (i.e., to find some K” such that 
Lo N SH) C Lo N s(#")) can be solved by expanding K by any 
indicative sentence p such that —p € Le N s(#). (Unless, of course, 
s(K) is negation complete.) This method does not work for >- 
extended belief sets, however, since the addition of an indicative 
sentence that is compatible with all previously supported indicative 
sentences typically withdraws the support of conditional sentences 
that were previously supported. As an example of this, suppose that 
I previously knew nothing about John’s profession. Furthermore, 
suppose that I acquire the belief that John is a driver by profession. 
No indicative sentence represents a belief that is lost by this,’® but 
support will be lost for various conditional sentences, such as ‘If 
John goes home from work by taxi every day, then he is a rich man’. 

Thus, neither of the two obvious ways to obtain indicative belief 
sets K, and Ky such that K, C Kg, namely, contraction and expan- 
sion, seems to work in general for >-extended belief sets. (As has 
been indicated by Levi and Rott, similarly formed -extended belief 

15 I use ‘contraction’ ın the general sense of an o ion on a belief state which 
retracts a certain sentence (or set of sentences) the set of supported sen- 
tences. A contraction ws nonvacuous if and only if the sentence to be retracted 
was supported by the original belief state. 

19 t of course, indicative sentences that refer to my belief state, such as ‘I 
know no about John’s profession’. The inclusion of such sentences into Oe 
cally closed representations rise to serious problems. See my ‘ 
Paradox of the Believer,” Phi xxI (1991): 25—30. For our present pur- 
poses, these problems will be ded by the assumption that neither Le nor Lı 
contains doxastic or epistemic operators. 
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sets, where © denotes possibility, have essentially the same property. 
The nonvacuous contraction of q typically provides support for 
© — q, and the nonvacuous expansion by q withdraws support for 
oa”) 

In ae to see how the differences between indicative and >- 
extended belief sets will affect the proof of the impossibility 
theorem, let us consider the following version of the central part of 
Gardenfors’s proof. Following Gärdenfors, it will be assumed in the 
following passage that belief states are identical to the >-extended 
belief sets that they support. 

Let K be a >-extended belief set, let p and q be sentences such that 

K + ~q, K + (p= q), and K + (~p & —9) are all >- extended belief sets 

and that —ġ is an element only of the third of these. Then K + —g and 

K + (p = q) are both proper subsets of K + (~p & —q). By (M), that 

follows from (R), we may then conclude that (K + —q)*p and 

(K + (p = q))*p are both proper subsets of (K + (~p & —q))*p. Thus by 

(P) both ~g and p ™ q are elements of K + (~p & —q)*p. By (1), p is also 

an element of K + (~p & —q)*p, and by (2) K + (~p & —q)*p is consis- 

tent. By (L), this is a contradiction. 

By way of example, let us consider beliefs about a vote in parlia- 
ment. Let p denote that a certain party is going to recommend its 
M.P.s to vote in favor of a certain bill, and let q denote that my M.P., 
Ms. Quinton, will vote in favor of that same bill. (For simplicity of 
logic, we shall disregard abstentions.) Let K be an >-extended belief 
set that corresponds to (and, on Gardenfors’s assumptions, is identi- 
cal to) a belief state in which I know nothing about how the party or 
Ms. Quinton are going to vote. 

In the proof (in the above version) it is assumed that, when I 
adopt (only) the belief ‘Ms. Quinton is going to vote against the bill’, 
then my new >-extended belief set will be K + —q. This, however, is 
not realistic. When learning that Ms. Quinton is going to vote 
against the bill, I also acquire support for the sentence ‘If the party 
recommends voting for the bill, then Ms. Quinton votes contrary to 
the party’s recommendation’, i.e., p > (q = —p). This sentence is 
clearly not an element of K + ~q. 

Similarly, when I adopt (only) the belief ‘Ms. Quinton is going to 
vote for the bill if and only if the party recommends its M.P.s to do 
so’, then my >-extended belief set is assumed in the proof to be 
changed into K + (p = q). In this case, however, I also acquire 
support for the sentence ‘If the party recommends voting for the 
bill, then Ms. Quinton votes in favor of the bill’, i.e., p > q. 


7 Levi, “Iteration of Conditionals’; Rott, ‘Conditionals and Theory Change,” 
p. 109. 
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$ > (q = —>) and p > q are clearly inconsistent. Therefore, the 
>-extended belief set that is obtained from K by acceptance of —q 
and the >-extended belief set that is obtained from K by acceptance 
of p = q cannot both be subsets of one and the same consistent 
>-extended belief set (such as K + (~p & —g)). Thus, the use of 
expansion in the proof is at variance with what seems to be a reason- 
able account of the dynamics of >-extended belief sets. 

Admittedly, expansion and the (E) condition are not essential for 
the proof. In order to reconstruct the proof it is sufficient to con- 
struct, in some way or other, three >-extended belief sets K,, Ky 
and K such that K, C K, K, C K, K, ¢ K, and K, ¢ K,. The most 
obvious way to do this would be to perform two different non- 
vacuous contractions of K. As was shown above, however, although 
contraction easily provides us with an indicative belief set that is a 
proper subset of a given indicative belief set, it does not in general 
provide us with a >-extended belief set that is a subset of a given 
>-extended belief set. Furthermore, a more general argument can 
be given which suggests that there is no way in which three >- 
extended belief sets K,, K}, and K with the desired properties can 
be obtained. The argument takes the form of a theorem that de- 
pends on the following four properties of belief states: 


(LC) Let A be a set of sentences and p a sentence. If A c s(#f) and 
p E Cn(A), then p E $K) (logical closure). 

(NC) Let K be a consistent belief state and p a sentence such that 
p E SK). Then —p € s(#) (negation consistency). 

(CC) Let K be a consistent belief state and p a consistent expression. 
If p > qı E SFK) and p > qa E SK), then q, & gz is consistent 
(conditional consistency). 

(CP) For all p and q, if p € s(H), q £ (90, and —y E (F0), then 
q > (p & q) E SH) (conditional preservation). 


Logical closure (LC) is a direct consequence of the meaning of ‘sup- 
port’ in this context. It should be observed that closure is only pos- 
tulated for the set of sentences that are supported by K. The belief 
state K may nevertheless be a set of sentences that is not closed 
under consequence, or it need not be a set of sentences at all. 

Negation consistency (NC) and conditional consistency (CC) may 
be seen less as postulates than as parts of a definition of what it 
means for belief states to be consistent. If the background logic is 
supraclassical, then (NC) and (CC) are necessary conditions for a 
belief state to be consistent. For my present purposes I do not need 
to know whether or not the conjunction of the two is a sufficient 
condition for consistency. 
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Conditional preservation (CP) is the least obvious of the four pos- 
tulates. It can be argued for with more or less covert reference to 
the Ramsey test, but that would be too close to begging the ques- 
tion. It remains, therefore, to appeal directly to prephilosophical 
intuitions about conditional inference. Let p and q be arbitrary sen- 
tences that may be either connected or unconnected in terms of 
subject matter. It can easily be verified that (CP) conforms with 
intuition in various such cases. As one example among many, sup- 
pose that you believe that Stockholm is the capital of Sweden (p) but 
have no opinion of whether or not Stockholm is the largest city in 
Sweden (q). Then you believe that, if Stockholm is the largest city in 
Sweden, then it is both the capital of Sweden and that country’s 
largest city. (As the reader can straightforwardly verify, the theorem 
also holds if q > (p & q) is replaced by q > p in the definition of 
conditional preservation. q > (p & q) has been chosen since it is 
more conformant with linguistic convention.) 


THEOREM 1. Let K, Kı, and K, be consistent belief states that satisfy 
(LO), (NC), (CC), and (CP). Then, if L, N H) Ss L, N SH) and 
Li O KHa) S Li N AK), then either L, N AK) a Li N SKa) or 
L,NKKH,) E L N K). 


For proofs of theorems and observations the reader is referred to 
the appendix below. 

In summary, the impossibility theorem depends on the un- 
warranted generalization of closure under expansion (E) from indic- 
ative belief sets, for which it holds, to >-extended belief sets, for 
which it does not hold. One way to reconstruct the theorem would 
be to use contraction in lieu of expansion to construct the three 
>-extended belief sets that are essential for the proof. That too, 
however, would require an unsustainable application to >-extended 
belief sets of principles that are valid for indicative belief sets. In- 
deed, if four reasonable postulates for the dynamics of belief states, 
namely, (LC), (NC), (CC), and (CP), are accepted, then it follows by 
theorem 1 that there is no way to reconstruct the proof of the 
impossibility theorem. 

Having seen that the Ramsey test can be defended against the 
impossibility theorem, I shall now tum to the construction of a 
formal account of epistemic conditionals which accords with the 
Ramsey test. 

II. A FORMAL ACCOUNT OF EPISTEMIC CONDITIONALS 
In this and the following section, I shall make the simplifying as- 
sumption that the language contains no other symbol than > in 


530 THE JOURNAL OF PHILOSOPHY 


addition to those of truth-functional and predicate logic, i.e., that 
L, comprises the whole of the language. 

According to the standard version of the Ramsey test, the set of 
conditional sentences that are supported by a belief state % can be 
determined from (1) the set of indicative sentences that K supports, 
and (2) the behavior of # under revision. Since belief states are then 
completely determined by these two factors, we may, without loss of 
generality, let each belief state be represented by a pair (K, DY of 
one indicative and one dynamic component. K, the indicative com- 
ponent, is a set of sentences such that Cn(K) = La N (F0. K may, or 
may not, be closed under logical consequence so that Cn(K) = K. D, 
the dynamic component, is a formal structure of some kind that 
determines, for each sentence p, the outcome of the revision of 
(K, D) by p. 

Due to our restriction of the language, s((K, DY) = L, for all 
belief states (K, D). The support function s should have the follow- 
ing three properties: 


G) If p E Lo, then p E s((K, DY) iff p E Cn(k). 
Gi) s((K, DY) = Cn(s((K, DY) 
(iii) p >q € (K, DY) holds if and only if p > q 15 supported by the 
Ramsey test, as applied to (K, DY. 


According to (i), truth conditions (support conditions) for indicative 
sentences are independent of the dynamic component of the belief 
state. According to (ii), the support function is closed under logical 
consequence. (ii) implies, for instance, that if p > q © 5((K, DY) and 
r> tE sK, DY), then (p>) &(r>d)E s(CK, DY). (iii) signals 
compliance with the Ramsey test. Before more can be said about the 
formal interpretation of (iii), we need to have a closer look at the 
structure of the dynamic component D of a belief state (K, DY. 
The most obvious construction is to let D be either a revision 
operator for K or some formal device (such as a selection function 
or a relation of epistemic entrenchment) that uniquely determines a 
revision operator for K. In an important sense, however, revision 
operators such as those of the AGM formalism are insufficient for 
this purpose. When revising a belief state (K, D) by a sentence p, we 
want the result to be a new belief state with the same structure, 
(K’, DY. Given a belief state (K, *), in which K is an indicative belief 
set and * an AGM operator of revision for K, we obtain for each 
indicative sentence p a new indicative belief set K’ = K*p. But we do 
not obtain, directly from K and *, a new revision operator * for K’. 
Since the dynamic component of the new belief state is not deter- 
mined, neither is the set of conditional sentences that it supports. 
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There seem to be two major ways out of this dilemma. One of 
these is to find some formal structure for the dynamic component of 
(K, DY which does not have this limitation. In other words, for each 
indicative sentence p, D should determine not only the indicative 
belief set K*p but also a new dynamic component D that combines 
with K*p to form a new belief state of the same form, (K*p, D'Y. It is 
no easy matter, however, to finda plausible formal structure for the 
dynamic component which has this property.’® 

The other, more simplistic, solution is to assume that the dynamic 
component is determined by the indicative component, i.e., that for 
each K there is only one D with which it can be combined. This 
seems to have been commonly taken as a tacit assumption in the 
belief-dynamics literature. In an earlier paper,’® I brought it up to 
the surface, giving it the following mathematical formulation: the 
background apparatus that is common to all belief states contains, 
along with the language and the logic, a function f, the superselector 
(so called since it selects selectors). To each set K that can form the 
indicative part of a belief state, fassigns a dynamic component f (K). 
The latter may be either a revision operator for K or some selection 
mechanism (such as a selection function) that in its turn determines 
a revision operator for K. In the former case, given a belief state 
(K, *), revision by p results in the new belief state (K*p, f(K*p)) 

This construction clearly solves our dilemma from a formal point 
of view, and it provides us with an account of belief revision that 
treats first-order conditionals in the way prescribed by the Ramsey 
test. But the construction has a problem from the point of view of its 
intuitive interpretation: if it is assumed that the indicative compo- 
nents of belief states are closed under logical consequence, then for 
each indicative belief set there is, with this construction, only one 
revision operator with which it can be combined. 

To see why this is an implausible feature of our construction, let p 
(as above) denote that the party recommends voting in favor ofa 
certain bill and q that Ms. Quinton, M.P., votes in favor of it. Fur- 
thermore, let K = Cn({p}), and let * be a revision operator for K. 
Then, with weak and reasonable assumptions on *, K* q= K* =q), 
from which follows that f(K*q) = f (K* (p = q)). It follows from this 
that the result of revising (K, *) first by q and then by —p must be 
the same as that of revising (K, *) first by p = q and then by “Pp. 
From an intuitive point of view, however, the result of the first chain 


18 See my “In Defense of Base Contracuon,” in , XCI (1992): 239-45. 
19 “A Dyadic resentation of Belief,” in ors, ed., Belef Revision 
(New York: Cambridge, 1992), pp 89-121 
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of operations should be a belief state that supports g, and that of the 
second chain of operations should be a belief state that supports ~g. 

One would wish a formal system of belief dynamics to allow one 
and the same indicative belief set to take on different dynamic prop- 
erties, and thus also to be combinable with different sets of condi- 
tional sentences. There is, perhaps surprisingly, a way to achieve this 
without giving up the formal simplicity of the superselector ap- 
proach outlined above. The trick is to let K, the indicative part of 
the belief state, be a belief base that is not closed under logical 
consequence. One and the same indicative belief set can then be 
represented by different belief bases, each of which has the belief 
set as its logical closure. The superselector will assign different revi- 
sion operators to different belief bases with the same indicative be- 
lief set. In this way, one and the same indicative belief set can appear 
in belief states that are different in terms of their dynamic proper- 
ties and in terms of the conditional sentences that they support. 

To see how this works in the above example, let the indicative 
belief set Cn({p}) of the original belief state be represented by the 
belief base {p}. With reasonable assumptions about (minimal) revi- 
sions of belief bases, the result of revising the original belief state by 
q should be a belief state with the belief base {, q}, and further 
revision by —p should result in a belief state with the belief base 
{—p, q}. On the other hand, revision by p ™ q should yield a belief 
state with the belief base {p, p = q}, and further revision by —p 
should yield the belief base {—p, p = q}. Thus, q is supported in the 
first case and —q in the second, in conformance with what was called 
for in the informal example. 

The congruity in this case between intuitive inference and opera- 
tions on belief bases is no coincidence. As I have indicated else- 
where,” the belief base approach can account for a surprising range 
of cases in which one and the same indicative belief set appears in 
belief states with different dynamic properties. (The usefulness of 
the belief base approach for the analysis of conditional sentences 
was pointed out by M. Morreau.*") 

In summary, the application of superselectors to belief sets allows 
for a formal account of belief dynamics that 


(1) conforms with the Ramsey test, and 
(2) provides, as the result of a revision, not only a new indicative 
belief set but also a new >-extended belief set. 


P “New sd erat For Theory Change,” Theona, LV (1989). 114—132; and “In 
Defense of Base Contraction.” 

1 “Epistemic Semantics for Counterfactuals,"”” Journal of Philosophical Logic, 
xxi (1992): 88-62. 
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If we use belief bases instead of belief sets, then the formal system 
also 


(3) allows one and the same indicative belief set to be the indicative 
part of different >-extended belief sets. 


I do not wish to claim that there is no other way to achieve these 
properties, but this is a simple and plausible way that is well worth 
exploring. 

Since, in this approach, the second (dynamic) component of a 
belief state is a function of the first (indicative) component, namely, 
the belief base, we may omit reference to the second component. In 
other words, each belief state can be identified with its belief base. 
This will be done in the rest of this paper. 

We now have enough background for the formal statement of a 
belief-revision system that conforms with the Ramsey test for (first- 
order) conditionals. Revision operators will be based on selection 
functions in the way that is standard in the AGM tradition.” 


DEFINITION 1. Let K S Lo and q E Lo. K 1 q, the remasnder of K by 
q, is the set of inclusion-maximal subsets of K that do not have q as a 
logical consequence. 

A selection function for K is a function y such that for all 
q E Ly: y(K 1 q) is a nonempty subset of K 1 q, unless the latter is 
empty, in which case y(K L q) = {K}. 

A selection function y for K is relational if and only if there is a 
relation € on the set of subsets of K such that for all nonempty K 1 p: 


y(K Lp) = {K EKLpIK 4K for all KEK 1 p}. 


DEFINITION 2. * is an operator of partial mest revision for a subset K 
of L, if and only if there is a selection function y for K such that for all 
p € Le: K*p = (MK 1 —p)) U {p} 
DEFINITION 3. A belief-revision system (in a given language and rela- 
tive to a given logic) is a pair (B, f ), where B is a set of belief bases and 
f is a function that to each K E 8 assigns a revision operator *, such 
that for all indicative sentences p of the language, K*p © BP 

A belief-revision system is orderly if and only if for each K € B, fK) 
is a partial meet revision operator that is based on a selection function 
that is relational by some relation $x that is reflexive, transitive, and 
complete, and also has the maximizing property, namely, that for its 
strict counterpart <, if A C B then A<B. 


Sipe “Reversing the Levi Idenuty,” forthcoming in Journal of Philosophs- 
gic. 

8 I owe the basic idea for this to a suggestion by André Fuhrmann m corre- 
spondence. 
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If (B, f is orderly, then the (global) revision operator * such that 
for each K E $, K*p = K*gp (with *, = f(K)) is an orderly revision 
operator. 


The support condition (truth condition) for indicative sentences 
has already been mentioned: if p € Lo, then p € s(K) if and only if 
p E Cn(K). For conditional sentences, the following support condi- 
tion will be used: 

(R1) Let p and q be indicative sentences and K a set of indicative 
sentences. Then p > q E (K) iff q E K*p, where * is the unique 
revision operator that is assigned to K in the belief-revision 
model. 


This account of conditional sentences is “parasitic” in Levi’s termi- 
nology, i.e., the set of supported conditional sentences is (in a given 
belief-revision system) completely determined by a set of supported 
indicative sentences (the belief base).24 The above argument may be 
seen as a demonstration that a parasitic account of conditionals can 
be motivated independently of Levi’s metaphysical standpoint on 
which sentences can or cannot be true or false. 
IV. ITERATED CONDITIONALS 

An important limitation of this type of parasitic account was pointed 
out by Gardenfors when commenting on Levi’s analysis of condi- 
tionals: “The main problem with this position is that it does not 
account for iterated conditionals. What kind of belief would a propo- 
sition like the following be on this view? ‘If this vase breaks if 
dropped on the floor, then it breaks if thrown against the wall’. 
In an answer to Gardenfors, Levi questioned that “bona fide exam- 
ples of such iterated conditionals are used in serious discourse 
or inquiry,”*° but he nevertheless offered acceptability criteria 
for some types of iterated conditionals. He did not, however, pro- 
vide a fully formalized criterion for conditionals of the type 
(p > q) > (r > t), and he “cheerfully plead[s] guilty” to the charge 
that “this account of iterated conditionals is not fully systematic in 
the sense that I have failed to supply a single acceptability condition 
applicable to both noniterated and iterated conditionals” (ibid., P- 
78). According to Levi, both p > q and r > ¢ in this example are 
disposition sentences of a type that should not be interpreted as 
conditionals. 


M Cf. Levi, “Iteration of Conditionals,” p. 66. 
3 Gärdenfors, “Belief Revisions,” p 90 
3 Levi, “Iteration of Conditionals,” p- 73. 


IN DEFENSE OF THE RAMSEY TEST 535 


In Levi’s approach, conditional sentences are parasitic on (logi- 
cally closed) indicative belief sets (corpora), rather than on belief 
bases. Gardenfors’s argument, however, is equally valid against the 
approach that was introduced in section III, with its parasitism on 
belief bases. In this section, I intend to show that there is a reason- 
able interpretation of the Ramsey test under which a fully systematic 
account can be obtained of iterated conditionals of arbitrary com- 
plexity. 

The following is the most general version of the new interpreta- 
tion: 


(R2) For any belief state K and any two sentences p, q E Ly, 
p > q © AK) iff q E s(K’) for any K’ that is the out- 
come of a smallest possible change of # to make it support p. 


The major difference between (R1) and (R2) is that, whereas the 
former stipulates a specified method by which to incorporate p into 
the belief state, the latter only postulates that the outcome of the 
change (however it is brought about) should be that p is supported 
and that, given this, the original belief state has been changed as 
little as possible. One argument in favor of the more general ap- 
proach of (R2) is that, in some cases, the most natural way to “add” 
a sentence to one’s stock of supported sentences does not seem to 
be an operation of revision. In particular, as was observed by Levi, 
the process of acquiring support for the sentence Op (‘it is possible 
that p’) seems to coincide with the process of contracting the sen- 
tence —p (ibid., pp. 70—1). 

In what follows, (R2) will be applied to belief states of the type 
that we ended up with in the previous section; thus, belief states are 
represented by belief bases. In order to give precision to the notion 
of a smallest possible change (or minimal change), I shall make use 
of three-termed similarity relations of the type proposed by David 
Lewis:?” 


S(X, Y, Z) “X is at least as similar to Y as is Z to Y” 
The strict counterpart will here be denoted as follows: 
SX, Y, Z) = SX, Y, Z) & =S, Y, X) 


Lewis proposed that the similarity relation should be a (weak) order- 
ing and that each world should resemble itself more than any other 


Y! Counterfaduals (Cambridge: Blackwell, 1973) 
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world resembles it (ibid., pp. 48-9). Thus, S should satisfy the follow- 
ing conditions: . 


(S1) S(X, W, Y) V S(Y, W, X) completeness 
(S2) S(X, W, X) reflexivity 
(S3) S(X, W, Y) & S(Y, W, Z) > S(X, W, Z) transitivity 
(S4) S(X, W, W) > X= W centering 


For each world W, axioms $1—S4 completely characterize those 
sphere systems around W in which all worlds belong to some sphere 
(ibid., p. 49). These axioms can also be derived from plausible con- 
ditions on a more general format for similarity, namely, a four- 
termed similarity relation ‘X is at least as similar to Y as Z is similar 
to W’.8 

In his seminal work on similarity of sets, A. Tversky? proposed 
that the similarity between two sets A and B should be measured bya 
real-valued function F(A N B, A\B, B\A) that increases with its first 
argument and decreases with its second and last arguments. The 
following two conditions on a similarity relation follow his ideas so 
closely that I venture to call them the Tuersky properties: 


(T1) If KC KN K and K'\K = K’\K, then S(K”, K, K’). 
(T2) If K N K = KNK and K"\K C KAKE, then S(k’, K, K’). 


The Tversky properties make (S4) superfluous: 


OBSERVATION 1. Ifa similarity relation S satisfies (S2), (S3), (T1), 
and (T2), then it satisfies (S4). 


We now have the tools necessary to give precision to the phrase 
‘smallest possible change’ of (R2) in terms of similarity relations: 


DEFINITION 4. K is a p-set iff either p E K or p E€ s(K)/Lo. 
K' is a (consistent) K-closest p-set by a similarity relation S iff K’ 
is a consistent pset and there is no consistent p-set K” such that 


S(K, K, K’). 


For simplicity, the word ‘consistent’ will be deleted in the phrase 
‘consistent K-closest p-set’. 


= Cf T. Williamson, ‘First-Order Logics for Comparative Similarity,” Notre 
meet iey of Formal tope XXIX (1988): 457-81; my “Similarity Semantics 
and i Changes of ef,” forthcoming in Erkenntnis. 

® “Features of Simularity,” Psychological Review, LXXXIV (1977): 327-52. 
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(RY) For any belief base K and any two sentences p, q © Li, 
p > q © SK) iff q E s(K’) for all K that are K-closest p-sets 
by a given similarity relation S. 


This will be called the similarity-based Ramsey test, in contrast to the 
(conventional) version (R1), which will be called a reviston-based 
Ramsey test. The revision-based version has the important advantage 
of being simple and direct for first-order conditionals. The similar- 
ity-based version has the advantage of being defined for iterated 
conditionals of arbitrary complexity. In order to combine both ad- 
vantages, one would need a similarity-based Ramsey test that coin- 
cides for first-order conditionals with a revision-based Ramsey test. 
In other words, the following type of relationship should hold be- 
tween a similarity relation and a revision operator: 


DEFINITION 5. A similarity relation S corresponds to an operator * if 
and only if for each consistent set K of indicative sentences and each 
consistent indicative sentence p, K*p is the intersection of the set of 


K-closest p-sets by p. 


It follows from the definition that for each similarity relation S there 
is exactly one operator * such that S corresponds to *. The following 
theorem shows that we can indeed combine the advantages of the 
two versions of the Ramsey test. In other words, the parasitic ac- 
count can be extended to cover iterated conditionals of arbitrary 
complexity. 


THEOREM 2. Let * be an orderly revision operator. Then there is a 
similarity relation S that corresponds to * and that satisfies (S1), (S2), 
(S3), (S4), (T1), and (T2). 


In general, there are many ways in which an orderly revision opera- 
tor can be “extended” to a similarity relation to which it corre- 
sponds. In other words, revision by conditionals requires more in- 
formation about entrenchment or epistemic value than what is re- 
quired for revision by indicatives. 

The following are some of the logical properties of the condi- 
tional connective in this framework: 


OBSERVATION 2. Let K be a consistent subset of Lo, and let > be 
evaluated, in the manner of (R2’), according to a similarity relation that 
satisfies (S1), (S2), (S3), (S4), (T1), and (T2). Then 


(1) p > p ESK) 
(2) p >q EK) iff p > p&q ESK) 
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(3) If p > qa © $K) for all 1 < k < n and r E€ Cn({qy, ... qa}), then 
p>rEeEsxK). 


The following relationships do not hold in general, however: 


(nl) If ($ V q) > r E€ s(K) then p > r E s(K) (SDA, simplification of 
disjunctive antecedents). 

(n2) If p > r E s{K) then p & q > r E sK) (strengthening the ante- 
cedent). 

(n3) If p > q E s(K) then ~q > —p E s (K). 

(n4) If p > q € s(K) and q > r E s(K), then p > r E s(K) (transitivity). 


SVEN OVE HANSSON 
Uppsala University 


APPENDIX: PROOFS 


Before the proofs of the theorems and observations, two lemmas will be 
given that are needed for the proof. 


LEMMA 1. If a similarity relation S satisfies (S3), then it satisfies the 
following properties: 


(1) SX, W, Y) & SY, W, Z) > SX, W, Z) 
(2) SX, W, Y) & SY, W, Z) > SX, W, Z) 


PROOF OF LEMMA I. For part 1, suppose that S(X, W, Y) and 
S(Y, W, Z). It follows by (S3) that S(X, W, Z). If S(Z, W, X), then it fol- 
lows by (S3) from S(Y, W, Z) and S(Z, W, X) that S(Y, W, X), contrary to 
S(X, W, Y). Thus —S(Z, W, X). It follows from S(X, W, Z) and —S(Z, W, X 
that S(X, W, Z). 

For part 2, suppose that S(X, W, Y) and S(Y, W, Z). It follows by (S3) that 
S(X, W, Z). If SZ, W, X), then it follows by (S3) from S(Z, W, X) and 
S(X, W, Y) that SZ, W, Y), contrary to S(Y, W, Z). Thus —S(Z, W, X). It 
follows from S(X, W, Z) and —S(Z, W, X) that S(X, W, Z). 


LEMMA 2. Let p E Lo. If K' is a K-closest p-set by a similanty relation 
S that satisfies quasi transivity (S(X, W, Y) & S(Y, W, Z) > S(X, W, Z)), 
(T1) and (T2), then K' = M U {p} for some M E K | —p. 


PROOF OF LEMMA 2. Let K’ be a K-closest p-set. First sup- 
pose that K' N K ¢ K 1 —p. Then, since —p € Cn(K’), there is some 
M EK 1 —p such that K NKC MEK 1 —>. Let KY = MU {p}. Then 
K AKCE NK. It follows by (T1) that SK", K, (K N K) U {p}). If 
KNK < {p}, then S(K’, K, K’) follows directly. If K'\K ¢ {p}, then (T2) 
yields S(K’ N K) U {p}, K, K’), and quasi transitivity yields S(K’, K, K'). It 
follows from S(K", K, K') that K' cannot be a K-closest p-set, contrary 
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to the conditions. We may conclude from this contradiction that 
KNKEKL-». 

Next, suppose that K’\K ¢ {p}. Let KY = (K N K) U {p}. We then have 
KOK = RK" K and KKC K\K. It follows by (T2) that S(K’, K, K), 
so that K’ cannot be a K-closest p-set,-contrary to the conditions. Thus 
K'\K s {p}, from which the lemma follows directly. 

PROOF OF THEOREM 1. Suppose not. Then there are x, and x, in L, 
such that x, E s(#;)\s(H,) and xg E SKANK). 

We are first going to show that (~x; V —x3) € (K). Suppose not. Then it 
follows by x, E s(#,) and the logical closure (LC) of s(t,) that ~x, E (F). 
Thus —x, E (K), contrary to xp E SKa) S (K) and the negation consis- 
tency (NC) of s(#). We may conclude that (~x, V ~x) € (H). 
Furthermore, from x, ¢ s(#,) follows, by the logical closure (LC) of 
S(H,), that —(-x) V —x_) € s(#,). By conditional preservation (CP) fol 
lows that (~x; VY —x_) > (x; & (Cx; Y —x2))) E (H), i.e., that 
(Cxi V Tx) > (x; & xq) E (K). 

By a symmetrical argument follows ((—x, V xq) > (xq & —x,)) E (Hy). 
By s(#,) = (F) and SKa) & (F) follows that both 


(Cxi V aq) > (xı & xq) and ((~x, V xg) > (£3 & —™,)) 


are elements of K. 

Suppose that —x, V —x, is inconsistent. Then x is a logical truth so that, 
by logical closure (LC), x € s(#,). Since this is not the case, we may con- 
clude that ~x; V —x, is consistent. Thus conditional consistency (CC) is 
violated. This contradiction completes the proof. 

PROOF OF THE THEOREM 2. For each set K, let <r be the maximiz- 
ing ordering on the subsets of K on which revisions of K by * are based. Let 
u be any real-valued function u such that u(@) = 0, u(K) < 1 and for al M, 
N G K: u(M) < u(N) iff M <x N. For all sets A, let |A| denote the number of 
elements of A. The following construction will be used: 


SM, K, N) iff |M\K| — u(M N K) < |N\K| — uN N K) 


It follows straightforwardly that (S1), (S2), (S3), (S4), (T1), and (T2) 
are satisfied. We are going to show that S corresponds to *. For any 
indicative sentence p, let Z(K, p) = {M U {p}|M © K L —p and 
S(M U {p}, K, NU {p}) for all N E K L —p.}. We are going to show (I) that 
Z(K, p) is the set of K-closest p-sets by S, and (II) that K*p = NZK, p). 
Part I. Lf K’ is a K-closest p-set by S, then it follows from lemmas 1 and 2 
that K’ € Z(K, p). It remains to be shown that if K' E Z(K, p), then K’ is a 
K-closest p-set by S. Suppose to the contrary that K’ € Z(K, p) and that K is 
not a K-closest pset by S. Then there is some consistent set K” such that 
p EK and $K, K, K’), ie., |K’\K| — u(K” N K) < |K\K| — uK Nn K). 
Let K, be any set such that K N K s K, E€ K L >b, and let 
K, = K, U {p}. Then K,\K s KNK, so that |K,\K| < |A’\K|. 
By the reflexivity and maximizing property of <, it follows from 
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Kn K < K, =K 9 K that K7 9 K 4 KNK, stut 
u(K N K) < u(Ky N K). It follows that 


|Ka\K| — w(K, N K) < |K"\K| — u(K" N K), 


so that |Ky\K| — uk, N K) < |K\K| = uk N K), i.e., S(Kg, K, K), 
contrary to K E Z(K, p). It follows from this contradiction that Z(K, p) is the 
set of K-closest psets by S. 

Part II. By the definitions, 


K*p = (NYK L —p) U {p} 
= ({MEK L —p|N 4 M for all NEK 1 —p} U {p}). 


From M, NEK L —>p it follows that p E€ M iff p E N, so that |(M U {p})\K| 
= IW U {p})\K]. Therefore, S(M U {p}), K, (N U {p})) holds iff u((N U 
{p}) N R) < u(MU {p}) NR). Since p is an element of both M and N iff it is 
an element of K, this is equivalent to u(N) < u(M), and thus to N 4, M. We 
may therefore substitute S(M U {p}), K, (N U {p})) for N <, M in the 
above formula for Kp, and obtain: K*p = (N{M E K L —p|S(MU {2}, K, N 
U {p}) for all NEK 1 —p} U {p}) = NZ(K, $). This concludes the proof of 
the theorem. 

PROOF OF OBSERVATION 1. Suppose that (S2), (S3), (T1), and (T2) 
are satisfied and that X * W. We are going to show that —S(X, W, W). 

From X * W it follows that either W\X * Ø or X\W + Ø. 

If W\X * Ø, then S(W, W, X N W) follows by (T1). If X c W then 
SA N W, W, X) follows by (S2), and otherwise it follows by (T2). It then 
follows by (S3) and lemma 1 that $(W, W, X). 

If X\W * Ø, then SX N W, W, X) follows by (T2). If Wc X, then 
S(W, W, X N W) follows by (S2), and otherwise it follows by (T1). It then 
follows by (S8) and lemma 1 that §(W, W, X). 

In both cases, $(W, W, X) yields —S(X, W, W). 

PROOF OF OBSERVATION 2. The proofs of (1)(3) are simple. 

For (n1), let K = {q, r, p > —r}, and let S be such that {p,q p > r} is 
the K-closest p-set by S. 

For (n2), let K = {p,r,qg> —r}, and let S be such that {p,p&qq> =r} 
is the K-closest p & q-set by S. 

For (n3), let K = {p, q}, and let S be such that {p, —q} is the K-closest 
—¢set by S. 

For (n4), let K = {—p, q, r, p > —r}, and let S be such that {p,q bp > -r} 
is the K-closest p-set by S. 
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ENVY AND INEQUALITY* 


nvy seems to include the desire to eliminate inequality: the 
E subject’s unequal (namely, inferior) position compared to 

the object s position. In consequence, people raise two dif- 
ferent claiins regarding the envy-inequality relation: (a) the basis for 
envy is a concern for equality (and thus a reason for condemning 
this concern); and (b) reducing inequality will reduce envy (thus a 
reason for praising the concern for equality). The two claims consti- 
tute what seems to be the moral paradox of envy, namely, the fact _ 
that envy is morally condemned despite the apparent presence of afi 
important moral component, namely, the desire to eliminate in- 
equality. I believe that both claims are erroneous. I shall argue that 
envy does not involve a moral egalitarian concern and that reduced 
inequality does not lead to less envy—on the contrary, in most cases 
it raises the intensity of envy. 

The major question discussed here concerns the dependence of 
the intensity of envy on the subject-object distance. Is envy more 
intense when the object’s superiority is minor or when it is signifi- 
cant? This question can also be formulated in sociological terms: 
Will the reduction of social and economic gaps in a certain society 
lead to a reduction in the intensity of envy? Dealing with these re- 
lated problems requires the examination of the nature of envy, and 
in particular whether the focus of concern in envy is the subject’s 
inferiority or the object’s undeserved good fortune. In light of the 
answer to this question, the major variables responsible for the in- 
tensity of envy are indicated and the type of correlation between 
envy and the subject-object inequality is described. In the final sec- 


* I am grateful to Yoram Amiel, David Heyd, Giora Hon, Ruth Lorand, Saul 
Smilansky, Daniel Statman, Michael Strauss, and Eddy Zemach for helpful discus- 
sons. 
0022-362X/92/8911/551—81 © 1992 The Journal of Philosophy, Inc 

551 


552 THE JOURNAL OF PHILOSOPHY 


tion this correlation is examined by briefly analyzing a society in 
which inequalities are significantly reduced. 
I 


Envy may be characterized as a negative attitude toward another 
person’s superiority and the desire to gain what this person pos- 
sesses. Two major issues appear to be prominent in explaining this 
attitude: (a) inferiority, and (b) desert. Taking each issue as a central 
concern of envy results in the formulation of two major approaches: 
(a) envy is concerned with the subject’s inferronty; and (b) envy is 
concerned with the object’s undeserved good fortune. The first ap- 
proach assumes the objects of envy to be entitled to their superior- 
ity; envy is seen as a nonmoral emotion. In the second approach, 
envy might be basically considered a justified, moral emotion since 
the objects are unworthy of their good situation. 

Adam Smith’s view is compatible with the first approach; he em- 
phasizes the difference in position: “Envy is that passion which views 
with malignant dislike the superiority of those who are really entitled 
to all the superiority they possess.” This characterization is in accor- 
dance with Aristotle’s distinction between envy and indignation (or 
resentment). Both envy and indignation involve pain caused by the 
sight of other people’s good fortune. But whereas indignation is 
concerned with undeserved good fortune, envy is concerned merely 
with others’ having something that we want. John Rawls takes a 
similar stand and distinguishes between envy that arises because of 
“the better situation of others catches our attention,” and resent- 
ment that addresses injustice and wrongful conduct. Only resent- 
ment is a moral emotion. Descartes seems to be close to the second 
approach and emphasizes the unwarranted position of the object: in 
envy “we judge the others unworthy of their good.” Similarly, A. 
Ortony and his colleagues regard undeservingness as the central 
concern of envy and classify envy as a specific example of resent- 
ment. General resentment involves a concern for ‘“‘the undeserving- 
ness of the other person enjoying the benefit.” Envy adds to this 
concern the subject’s desire to obtain the benefit. Accordingly, they 
argue that “‘one’s sense that the other does not deserve his good 
fortune” is of crucial importance in these emotions, and “‘it is 
fuelled by one’s own sense of privation and by a perception that the 
other is comparable to oneself.’””? 


Smith, The Theory of Moral Sentiments (Indianapolis: Liberty, 1790/1982), p 
244, Anstote, Rhetoric, II, 9, 10; Rawls, A Theory of Justice (New York: Oxford, 
1972), p 533, Descartes, The Passions of the Soul, art 62, m J. Cottingham, R 
Stoothoff, and D Murdoch, trans , The Philosophical Wrisngs of Descartes (New 
York Cambridge, 1984); Ortony, G. L Clore, and A Collin g The Cognitrve 
Structure of Emotions (New York: Cambridge, 1988), pp. 101— 
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The assumption that moral concern is central to envy is fraught 
with difficulties. To begin with, envy is often directed at people who 
are lucky, but being lucky is not immoral. Moreover, envy addresses 
a partial concern rather than the general concern typical of the 
moral domain. It entails the desire to improve the subject’s personal 
lot, not concern for other people. This is in accordance with the 
discriminatory and partial nature of emotions in general (to be dis- 
cussed below). Furthermore, sometimes the envious person wishes 
to deprive the object of her greater benefits, even if this means 
depriving oneself of some benefits as well. There can hardly be 
moral justification for such a desire; it would make sense only if the 
comparative inferiority of the subject were of crucial importance.” 

If moral concern is not essential to envy, its classification as a 
particular example of resentment is inadequate. Resentment may be 
characteriz¢d as an emotional protest against what is perceived as 
morally unjust. Accordingly, it is closer to anger than to envy. Al- 
though resentment is sometimes directed at the superiority of 
others, the focus of the subject’s concern is some kind of moral 
attitude rather than superiority itself. Here we may distinguish be- 
tween discontent, envy, and resentment. Discontent arises when we” 
believe that something (of some importance to us) is wrong, namely, 
a better alternative is available. Envy occurs when the wrongness is 
related to our inferior situation; resentment occurs when wrongdo- 
ing is perceived: it conveys an implicit accusation. When unjust 
treatment, but not inferiority, is perceived, anger and resentment 
are more dominant than envy. The relation between envy and re- 
sentment is somewhat similar to that between regret and remorse, 
except that the latter are directed at oneself. Like resentment, re- 
morse is usually directed at moral wrongdoing; as in envy, the main 
concern in regret is undesirability or dissatisfaction and not neces- 
sarily immorality. Furthermore, resentment and remorse presup- 
pose some responsibility of an agent (object in resentment and sub- 
ject in remorse) for the immoral behavior, such responsibility may 
be absent from envy and regret.* Envious people often like to em- 


? The readiness to suffer in order to make the other person suffer more 1s 
illustrated in the story about the angel who came to earth and promised a farmer 
to fulfill any wish, though on one condition. his neighbor would receive twice as 
much of the same. In that case, said the farmer, please remove one of my eyes. On 
the relationship between moral and emotional attitudes, see J. Oakley, Morality 
E A Routledge, 1991); my ‘Emotional and Moral Evalua- 


Soe R Folgen, Pe XX (1992): 214-29 
See R. F "Perceived Injustice, Referent iuons, and the Concept of 
ep 
Bist sat 1,” Representatsve Research tn Psychology, XIV (1984). 88— 


108; W. G. Parrott, “The Emotional Experience of Envy and Jealousy,” in P. 
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phasize their concern for moral justice, thus attempting to justify it. 
Accordingty, they tend to describe their attitude as resentment 
rather than envy. It is clear, however, that this is often a kind of 
rationalization of their negative attitude to being inferior. Hence, 
the transformation of envy into resentment is common, whereas the 
opposite transformation is rare. 

If desert is not the central concern of envy, the natural candidate 
for such concern is inferiority. The importance of the inferiority 
concern in envy conveys the weight we attach to our comparative 
stand. People compare themselves with others in order to reduce 
uncertainty about themselves and maintain or enhance self-esteem. 
An unfavorable comparison often leads to envy. Social comparison 
appears in many forms; an obvious form typical of envy is competi- 
tiveness. Our inferiority displeases us since it is perceives as reflect- 
ing some loss in a certain ongoing competition. The ‘differences 
among different people are often perceived as reflecting different 
positions in an underlying competition. The connection between 
competitiveness and envy is aptly described by A. Kohn:* 


Competition creates a prized status where none existed before, thereby 
giving us something to desire. Then it insures that not everyone can get 
it. Finally, competition requires that those who obtain the reward can 
do so only by defeating everyone else. Both the objective and subjective 
conditions for envy are established, in other words: restricted acceas to 
something desired and a (quite accurate) belief that someone else has 
got it at one’s own expense (ibid., p. 141). 


Since we usually do not compete with dead people, we usually do 
not envy them and hence can afford to be generous. Following 
Hobbes, we may even say that praise of ancient authors frequently 
proceeds from envy of the living with whom we compete rather than 
from reverence of the dead. When we consider ourselves as compet- 
ing with the dead, as when we want to outstrip their achievements, 
we may envy or be bitter toward them. Aristotle” argues that the lack 
of competition is also the reason why we do not envy those not yet 
born. It should be noted that envy presupposes a certain social com- 





Salovey, ed., The Psychology Jean ae (New York: TTEN a pp. 
8-9, G Taylor, Pride, Shame and Guili (New York: Oxford, 1985), 
‘ No Contest: The Cass Against Competition (Boston: Houghton » 1986) 
5 Rhetoric, II, 10 It seems, however, that we envy future more than past genera- 
er One reason is the constant improvement in the quality of life. (Since such 
improvement was less typical in ancient times—it was even assumed that the 
z age was over—this factor has lesser weight in Aristotle’s view.) Another 
reason is rooted in the fact that it is casier for us to imagine a past world without 
us, than a future world. 
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parison, but not necessarily competition (though it typically does). 
Competition is one form of social comparison. My wish to be as 
wealthy as Rockefeller involves social comparison, but not competi- 
tion. I cannot conceive of myself as competing with Rockefeller in 
this regard. 

It is our relative deprivation, rather than all types of deprivation, 
which bothers us in envy. A rich person can be envious of another 
having a bit more; a beautiful movie star can be envious of another 
star’s beauty. ‘‘Enviers want to be better more than they want to be 
better off’; hence, “kings may seem to have almost everything good 
a person could have, but they may envy if there is anything good 
they yet lack and if they think that other kings have some advantage 
over them.”’® The notion of relative deprivation implies a distinction 
between the objective and subjective well-being of the subject, and 
the claim.that the two are not isomorphically related; sometimes the 
better off from an “objective” viewpoint feels subjectively worse off. 
Thus, a survey reveals that first-line supervisors earning between 
$12,000 and $15,000 a year were more satisfied with their salaries 
than company presidents earning about $40,000 a year. It is often 
true that “those who are the most deprived in an objective sense are 
not the ones most likely to experience deprivation.””” The fact that 
envy involves relative deprivation emphasizes the importance of the 
inferiority concern in envy. 

It may be argued that envy emerges not only when social compari- 
son indicates the subject’s inferiority, but whenever we desire some- 
thing that we lack; for example, when we are tired or wish to have 
more money. I believe, however, that such cases represent discon- 
tent or covetousness rather than envy. In discontent and covetous- 
ness, social comparison is not an essential component as is the case 
in envy. Unlike covetousness and discontent, which are merely con- 
cerned with gaining something or achieving a certain state, envy is 
mainly concerned with someone else who has something or is in a 
certain state. I covet money and am discontented with my financial 


° Montald, “Envy” (unpublished manuscript). It may be argued that envy does 
not represent the wish to eliminate inequality but to achieve superionty. Al ough 
I agree that the latter wish may be found in some cases of envy, it is the former 
which is more essential to envy. The latter wish 1s often found when envy 1s not 
present and many cases of envy lack it. 

7F. Crosby, “A Model of Egoisucal Relative Deprivation,” Psychological Re- 
view, LXXXII (1976): 85-113, p. 85 See also Kohn, J. Neu, "Jealous Thoughts,” 
in A O Rorty, ed., Expla:rning Emotions (Berkeley California UP, 1980). The 
relative deprivation typical of envy may, however, be connected with what the 
envious person perceives as nonrelative deprivation of respect ot attention that 
the subject would like to gain. Indeed, the quest for respect or fame is a crucial 
element of envy g 
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situation, but I envy the rich. Unlike envy, covetousness does not 
involve a belief in the subject’s inferiority vis-a-vis the person who 
has the desired object. In fact, people of superior standing often 
covet what belongs to people in an inferior position. Covetousness is 
more similar to greed than to envy. Like greed, it does not cease 
when the desire is achieved, and like kleptomania it is directed at 
something one does not really need. 

To conclude the first stage of our discussion: it is quite clear that 
the central element of envy is not a moral concern but perceived 
inferiority. If inferiority were the only concern in envy, we should 
expect a positive correlation between the intensity of envy and the 
subject-object inequality. In the next section I shall show, however, 
that this is not the case and that the relation between envy and 
inferiority is complex. 

0 l 
Unlike the proposed correlation between envy and inferiority, envy 
seems to be directed at those who are like us or equal to us but are 
still slightly superior to us. The Greek poet Hesiod already wrote 
that “potter is furious with potter and craftsman with craftsman, 
and beggar is envious of beggar and singer of singer.” Aristotle 
argues that “we envy those who are near us in time, place, age, or 
reputation.” Similarly, Bacon observed that ‘‘where there is no com- 
parison, no envy; and therefore kings are not envied but by kings.’ 
In envy, our attention is focused on those perceived to be immedi- 
ately above us, since (as G. Simmel remarks) these people occupy the 
first rungs on fortune’s ladder that we will have to climb. These are 
the people we are most likely to be compared with or whose accom- 
plishments are most likely to demean us. This has been described as 
“neighborhood envy”: each person in a hierarchy primarily envies 
the person immediately above himself. In cases of extreme inequal- 
ity, especially cases of unattainability, far less envy is aroused than in 


® See P. Walcot, Envy and the Greeks (Warminster. Ans & Phillips, 1978), p. 9 
Aristotle, Rhstonc, 0, 10; Bacon, Essays, in Selected Wnisngs (New York: Modern 
Library, 1625/1955), p 9 Kant’s postion on this issue is interesting Although he 
argues that ‘‘Persons of the same station and occupation in life are particularly 
prone to be jealous of each other,” he contends that peo people compare themselves 
with others in order to achieve a better self-image. A favorable companson is 
likely to emerge since people ‘‘choose as a rule the worst and not the best of the 
class with which they set up comparison; in this way their own excellence shines 
out.” See Kant, “Jealousy and its Offspring—Envy and Grudge,” in Lectures on 
Ethics (New York. Harper, 1963). Kant’s position is conmstent if we assume that 
we choose similar people as constituting our comparative framework, but within 
this framework the particular individual chosen for comparison is that occupying 
a somewhat mferior position Although it ıs true that m order to cheer up we 
sometimes compare ourselves to those inferior to us, such an escape device is 
often not helpful; otherwise, envy would not be so common. 
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cases of minimal inequality, which inevitably provokes the envious 
one to think, “I could be in his place.” Although in typical envy we 
imagine ourselves in the object’s place, we do not lose our identity; 
we keep it while imagining an improvement in our situation. It is 
easier to keep our identity while imagining ourselves in the place of 
those similar to us.? A woman may envy another woman for her 
beauty, but she will probably not envy a handsome man. Where no 
proximity exists, comparison is less likely to arise and we are less 
prone to feel inferior. Those who are close to us, but still above us, 
emphasize our own inferiority more than those who are distant from 
us. Moreover, the issue of desert is more central in closer proximity: 
the subject-object comparison is easier, the subject's (slight) inferior- 
ity is not taken for granted and its presence is more likely to be 
perceived as unfair. 

Research concerning levels of aspiration confirms the promi- 
nence of the proximity variable. It has been shown that in most 
situations the level of aspiration is placed slightly above the subject’s 
performance. This is in accordance with the tendency of the level of 
aspiration to stay out of too-difficult or too easy areas. It is also in 
accordance with our wish to be slightly better than those with whom 
we compare ourselves. An interesting study was made of people with 
different salaries. They were asked how much of a raise they re- 
quired in order to live comfortably. Most subjects in all salary ranges 
indicated that an increase of about 15 percent would suffice. To put 
it crudely, this is usually the range of our envy. Empirical evidence 
suggests that competitive comparison is maximal in small subject- 
object gaps in current positions, that is, when the subject is slightly 
inferior to the object.’° The factor of the availability of an alterna- 
tive, which is a major factor in the generation of envy and other 

emotions, is most significant in such cases (see below). 

The subject-object proximity typical of envy and other emotions is 
compatible with another salient characteristic of emotions: their 
partial and discriminatory nature. Not everyone and not everything 


9 CFE. J. Elster, “Envy in Social Life,” manuscript (1991), H Schoeck, Envy 
(New York: Harcourt, 1970), pp. 62, 237; M. Silver and J. Sabim, “The Percep- 
uon of Envy,” Socal gy, XLI (1978): 105-17, D H Gensthtcs: Moral 
Issues tn the Creation of People (Berkeley: California UP, 1991), p. 187 

10 See L. Festinger, “A Theory of Social Comparison Processes," Human Rela- 
tons, VO (1954) 117-40; K. Lewin, T. Dembo, L Festinger, and P. S. Sears, 
“Level of Aspiration,” in J M Hunt, ed, Personality and the Behavior Disorders, 
Vol. 1 (New York: Ronald, 1944), pp. 333—78; S Dakin and A. J. Arrowood, ‘The 
Social Comparison of Ability,” Human Relations, xxxv (1981) 89-109. The 
research concerning the different salaries was conducted over forty years ago I 
assume that by now the of our envy 1s wider because envy is stronger m 
contemporary society (see below). 
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is of emotional significance to us. We usually would not assume an 
emotional attitude toward someone completely unrelated to us. 
Thus, when we hear of the death of thousands of people in an 
earthquake occurring in a remote (that is, from our vantage point) 
part of the world, our emotional response comes nowhere near the 
intensity of our grief at the death of someone close to us, nor does it 
approach the level of feeling we experience in watching the death 
throes of a single victim of that same earthquake on television 
(thereby establishing some proximity with this particular victim). 
Emotions require many resources such as time and attention. Since 
these resources are limited, emotions must be partial and discrimina- 
tive.!! Subject-object proximity frequently employs a partial and 
momentary perspective that contradicts the impartial and broad 
perspective typical of scientific knowledge. When we look at some- 
one from a close distance, our vision of this person is often dis- 
torted. In the extreme case when there is no distance at all, namely, 
when one places the object upon the eye itself, one does not see the 
object. We need some distance in order to achieve a perspective that 
encompasses multiple aspects of the object and thereby makes the 
object less fragmentary. But keeping a distance is contrary to the 
involved and interested perspective typical of emotions. Accord- 
ingly, the broader the perspective, the lesser the intensity of emo- 
tions. Thus, a perspective that does not merely focus on the object’s 
superior position but also on that person’s character may reduce 
envy. If the object is kind and modest, envy may decrease since it 
can be assumed that the object respects the subject.’? Envy may also 
decrease if the object is mean since the negative evaluation of the 
object’s character will outbalance the object’s superiority (accord- 
ingly, it has been claimed that “only the shameless envy the 
wicked’’). What matters in reducing envy in these cases is not so 
much how we evaluate the object, but that we do so from a broader 
perspective referring to many aspects of the object. The broader the 


11 Spinoza argues that “the affects are excessive, and occupy the mind in the 
consideration of only one object so much that it cannot think of others ” For 
example, “a greedy man thinks of nothing else but profit, or money, and an 
ambitious man of esteem” (Ethics, Iv, Prop 44, Schol). 

12 Isocrates, the ancient Athenian orator, says that we envy those who surpass 
us in intelligence or in any other respect, unless we are won over by their daily 
kindness. See Walcot, p. 35. The absence of envy in this case may be due to our 
interpretation that these people deserve their good fortune (indicating that desert 
has some role in envy) or that their kmdness implies respect for others and hence 
they lack a feeling of superiority (Since envy is a complex state, it may be also 
directed, because of other reasons, at virtuous, superior people. If we could 
isolate the effect of being virtuous, however, the intensity of envy would be re- 
duced in most cases ) 
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perspective, the lesser the weight of the particular inferiority that 
generates envy. 

In light of the partial nature of envy, the relevant inferiority con- 
cern is also partial. It addresses people who belong to our emotional 
environment, that is, those of emotional significance for us. The 
inferiority concern in envy is partial (or specific) in two important 
senses: (a) the subject’s position is low compared to a spectfic person: 
it need not be low in comparison with the whole population, the 
average position in society, or even the top five percent; (b) the 
subject’s position is low in a specific domain particularly relevant to 
the subject’s self-evaluation. Social comparison is not exercised in- 
discriminately, but refers to people and domains relevant to our 
self-evaluation. We compare ourselves with people whom we con- 
sider to occupy an approximately similar position or possess similar 
ability. We tend to exclude from our reference group people who 
appear definitely superior or inferior to us as well as those belong- 
ing to irrelevant domains. 

The specific nature:of the inferiority concern in envy is related to 
the subject’s concern with relative and not general deprivation. 
Envy implies a particular situation worse than that of someone else 
of importance to us. We may envy only one aspect of another per- 
son, continuing to consider ourselves superior in general. We may 
envy someone for being rich, but if we know this person to be dis- 
honest, we consider ourselves superior. Generally, a subject, clearly 
superior to an object, may be envious of the same object who sur- 
passed her in a specific sphere. In such cases, envy would not usually 
be intense. Similarly, we may desire to possess something, but if this 
thing does not belong to our self-defining domains, envy will not 
arise or will not be intense. The lack of these things does not make 
us feel inferior since we “either have enough other good things to 
which our self-esteem is linked, or we have found deeper satisfaction 
elsewhere.”!? Whether we admire or envy another person is deter- 
mined, among other things, by the relevance of that person’s good 
fortune to our self-evaluation. When the good fortune is relevant, 
our self-evaluation and our evaluation by others is threatened and 
envy arises. This is particularly true of domains in which large dis- 
crepancies exist between actual and ideal self-description. Here the 
threat to the subject’s self-esteem is especially strong. This kind of 
specificity distinguishes typical adult envy from envy typical of chil- 
dren. General envy, nonspecific to certain people or domains, is 


13 Montaldi; my "Envy and Jealousy,” Canadian Journal of Philosophy, xx 
(1990): 487-516 
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typical only of young children for whom the value of most things 
comes simply from seeing others with them. Young children, who 
lack a well-defined sense of what is relevant or irrelevant to them- 
selves, feel discomfort every time they fail to measure up to an- 
other’s performance. Older children are more selective in this re- 
spect: failure in domains of importance elicits more envy than in 
marginal domains. The specific nature of the inferiority concern in 
adult envy explains why, contrary to common belief, psychological 
research has failed to reveal a significant positive correlation be- 
tween envy and an individual’s general low self-esteem. 

The foregoing discussion has indicated that envy is concerned 
with specific inferiority, not inferiority in general. This conclusion is 
incompatible with the assumption that inferiority is the sole concern 
in envy and that we should therefore expect a positive correlation 
between the intensity of envy and the subject-object gap in‘ current 
positions. Since social comparison is chiefly limited to those similar 
to us, envy should be more typical of small subject-object gaps. 
Moreover, in small gaps the question of desert is quite prominent. 
In comparison to people similar to us, we would presumably con- 
sider our inferior situation incompatible with what we deserve. In- 
deed, emotional intensity is greater in unjustified than in justified 
circumstances. This is because the situation is considered to be more 
abnormal and the availability of an alternative is greater. Further- 
more, if a strong wish to obtain something that another person has 
were to persist for a long time, the subject would tend to feel in 
some degree entitled to the desired object. Desert has after all some 
role in the generation of envy. Let us, therefore, examine the issue 


of desert. 
Il 


Both the approach considering inferiority to be the sole concern in 
envy and the approach characterizing envy as a moral emotion as- 
sume that desert claims are to be identified with moral claims. Ac- 
cordingly, the second approach, which realizes that desert has some 
role in the generation of envy, identifies envy with resentment and 
considers both to be moral emotions. The first approach, realizing 
that in many cases the moral concern is absent from envy, concludes 
that desert has no role in the generation of envy. The identification 


14 S, A. Bers and J. Rodin, ‘ ie emigre ealousy. A Developmental and 
Motivational Study,” Journal! of Personality and Psychology, XLVI (1984): 
766-79; P. Salovey and J. Rodin, “Envy and Jealousy in Close Relationship,” in C. 
Hendrick, ed., Clase Relationships (Newbury Park: Sage, 1989); A. Tesser and J. 
Campbell, ‘‘Self-Definiuon: The Impact of the Relative Performance and Simular- 


ity of Others,” Soaal Psychology Quarterly, xum (1980) 341-7. 
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of desert claims with moral claims is explicitly stated by a few philo- 
sophers. Thus, Aristotle argues that ‘‘whatever is undeserved is un- 
just,” and J. Hospers claims that “‘justice is (simply) getting what one 
deserves.’’'® I believe that the most plausible way to explain envy 
and resolve the moral paradox of envy is to reject this assumption 
and distinguish between moral and desert claims. 

The distinction between moral and desert claims is not easy to 
make and its detailed description is beyond the scope of this discus- 
sion. Nevertheless, I would like here to point out some important 
characteristics of the distinction which are relevant to the role of 
desert claims in envy. Whereas desert claims are based on the value 
of a person’s attributes and actions, moral claims often refer to 
obligations toward other persons. Desert claims are not necessarily 
grounded in anyone’s obligations, but rather, as G. Sher’ rightly 
claims, “in the value of persons’ coming to have what they deserve.” 
This is so, since if ‘‘persons themselves matter, then what matters to 
persons should matter as well.” Because of the more private nature 
of desert claims, their validity is often grounded in preinstitutional 
values; desert, Sher argues, “‘preserves its normative independence 
from institutional arrangements.” A major reason for the private 
nature of desert claims is that they are often based on one’s personal 
desires.'’ In perceiving our situation to be undeserved, we do not 
necessarily accuse a certain person of behaving criminally or immor- 
ally; we assume, however, that our being in such a situation is in 
some sense unfair. Whereas in moral claims the agent is a person 
having some responsibility, an impersonal cause can also be an agent 
in desert claims. Heavy rain may be the cause of undeserved but not 
of immoral circumstances. Similarly, whereas moral claims are typi- 
cally directed at a human being (and sometimes also other crea- 
tures), desert claims may also be directed at inanimate object. One 
can say (as Sher notes) that “Cleveland deserves better publicity, 


 Anstotle, Rhetonc, 0, 9. Hospers, Human Conduct: Problems of Ethics (New 
York: Pinak Brace Jovanovich, 1972), p. 361. 

Desert (Princeton: University Press, 1987), pp. 195, 58, 205 My discussion of 

desert is heavily indebted to Sher’s excellent analysis. Whereas r considers 

“desert” a moral notion that is distinct from “nght” and “justice,” I use “desert” 

in a broader sense and allow for desert claims that do not belong to the moral 


domain 

17 This is in accordance with the clam that desires confer value on their objects. 
Spinoza, e.g., argues that “we jJ something to be good because we strive for 
it, will it, want it, and desire it” , ID, 9, Schol.) For further discussion of 


this matter, see Sher, p. 55. See also D. A. Prentice and F. Crosby, “The Impor- 
tance of Context for Assessing Deservingness,” m J.C Masters and W. P. Smith, 
eds., Social Social Justice, and Relatrve Deprivation (Hillsdale: Erl- 
baum, 1987), pp. 165-82. 
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since it is an interesting city.” Desert claims are based on perceived 
undeserved situations that are not necessarily undeserved in a more 
objective sense. Perceived undeserved situations may be due to im- 
personal, arbitrary circumstances. Being unlucky may not involve 
any criminal or immoral deeds and attitudes of an agent, but the 
unlucky person may still rightly regard it as undeserved. It is not 
immoral for a rich person to win a big prize in the lottery or to 
marry a rich person; nevertheless, many poor people may consider it 
to be undeserved and envy this person. 

The desert claims typical of envy are personal and only rarely can 
be satisfied by a moral action. Such claims are not considered as 
serious moral claims. Concerning many of them, telling the subject 
to be satisfied with his own fortune can be considered to be a proper 
moral response. Moreover, in most cases there is no one to whom to 
address desert claims, and anyhow fulfilling these claims ‘cannot be 
done in light of practical considerations. When David envies Adam’s 
beauty, his envy involves a desert claim but not a serious moral 
claim. Telling David that he should be satisfied with his own good 
fortune in other domains is in this case a proper moral response. 
There is no one to blame for Adam’s beauty and it is impractical to 
change the situation by doing a plastic surgery on David and all 
other people who are not as beautiful as Adam. 

A distinction can be made between deserving and wishing. When 
we express our wish to have something, we do not necessarily imply 
that our lack of this thing involves unfairness or injustice. In claim- 
ing that we deserve something, we do suggest there is something 
unfair in the fact that we do not have this thing. Envy involves not 
merely a wish claim but a desert claim as well. Envy is often based on 
personal, nonmoral norms of desert, whereas resentment is usually 
based on societal moral norms of justice.!® Although envy typically 


18 The urge to find some kind of unfairness in the subject’s inferior postion 
may also be explained by referring to the saying that ‘Injustice is relatively easy to 
bear; it is justice that hurts.” A similar view to the one presented here us found in 
Richard Smith, “Envy and the Sense of Injustice,” in P. Salovey, ed , The Psychol- 

of Jealousy and Envy (New York Guilford, 1991). Smith speaks about “the 
subjective, unsanctioned nature of the sense of injustice in envy ” He rightly 
argues that “In an unconventional but arguable sense, people on the short end of 
the distribution have been unfairly treated by fate.” Accordingly, “the envied 
n's advantage is, to some degree and on some subjective level, unfair” (pp. 

B5, 92-3) This valuable article clearly shows the need to distinguish the sense of 
injustice typical of envy from that typical of resentment. Smith refers to the 
unfaimess typical of envy in order to explain the element of hostility in 
envy. I think that this sense of unfairness principally explains the element of 
discontent in envy. Smce this sense is often not directed at culpable deeds or 
attitudes of a specific agent, but at impersonal arcumstances, its mayor explana- 
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involves desert claims, it may sometimes involve moral claims. Envi- 
ous people often describe the unfairness involved in their situation 
as moral; they thus attempt to legitimize their envy and may even 
present it as resentment rather than envy. Since the borderline be- 
tween the moral and desert claims is not clear-cut, it is not always 
easy to distinguish between envy and resentment. 

It should be clear by now that the issue of desert plays some role 
in the generation of envy. This is not, however, the role suggested by 
the aforementioned two approaches. The first approach is right in 
arguing that inferiority is of concern in envy, and the second is right 
in assuming that desert is of concern in envy. The first is wrong in 
believing that this concern refers to what the subject considers to be 
deserved inferiority, and the second is wrong in claiming that the 
concern of desert is directed at the object. Accordingly, the focus of 
concern in envy is not merely the subject’s (deserved) inferiority, nor 
the object’s undeserved good fortune. Envy is rather concerned with 
the subject’s undeserved inferiority. The issue of desert is always ac- 
companied by the issue of the subject’s particular inferiority. The 
subject’s concern is not a general moral concern for justice, but a 
particular personal concern for what the subject considers to be 
undeserved inferiority. Sometimes envy includes the suspicion that 
the subject’s inferior position indeed reflects what the subject actu- 
ally is. Even in this case the subject may still feel the situation to be 
unfair since being inferior is unfair. Here the complaint is not 
against the wrongdoing of people but against a divine or imper- 
sonal cause.'? 

The element of desert is not obvious in some cases of what is 
commonly characterized as envy, for example, when I am hungry 
and see someone eating. The issue of desert is clearly present if I do 
not have the money to buy food. If I do have the money, however, 


tory role does not concern personal hostility toward a certain agent but a more 
general feeling of discontent. 

19 In many respects pleasure-m-others’-misfortune (Schadenfreude) is similar to 
envy. Like envy, typical cases of pleasure-n-others’-misfortune involve small dif- 
ferences and desert claims. Thus, this emotion may be characterized as delight in 
someone’s deserved and usually minor misfortune. When the object’s situation 
becomes highly undemrable, pleasure-in-others’-misfortune often turns into pity 
Pity is concerned with substantal suffering and not with minor misfortunes. Plea- 
sure-in-others’-misfortune conveys the belief that some justice has been done. 
The more just the misfortune is, the better justified 1s the pleasure in it. Indeed, 
we often hear people who delight in others’ misfortune saying that “God has 
punished this person,” or that ‘there is justice in the world.” See my papers, 
“Pleasure-In-Others’-Misfortune,” Iyyun, XLI (1992): 41—61; and “Another Look 
at Pleasure-In-Others’-Misfortune,” Iyyun, xL (1998). 
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but do not enter the restaurant because I am in a hurry, the issue of 
desert is hardly relevant. But such a case should not bé regarded as 
envy—anyway not a typical case of envy. It concerns a momentary 
deprivation and not relative inferiority. Moreover, if I am in a hurry 
to do something that I like, I value this thing more; accordingly, the 
momentary hunger will not be of significance and it may be over- 
come by eating a sandwich. If I am hungry because I am obliged to 
attend a certain tedious meeting, I may think that there is something 
unfair in my situation; in this case my attitude will be closer to envy. 
Deprivation in satisfying our needs is not identical with envy. Envy 
involves the subject’s relative inferiority and the belief that this de- 
privation is undeserved. Consider another example. Moses con- 
siders himself lazy but nevertheless envies Dean’s success, which is 
the result of Dean’s hard work. If Moses believes that his laziness is 
due to an innate disposition that he cannot overcome, he may con- 
sider his situation unfair, and envy is likely to emerge. If Moses can 
overcome his laziness, however, but chooses not to do so as he 
considers Dean’s success to be not worth the price of not being lazy, 
then envy is not likely to appear, or it will at least not be intense. 
Moses may want the results of Dean’s success, but since typical envy 
is directed at a person and not at a certain thing that the person has, 
this would not be typical envy in which the subject is ready to ex- 
change his present position with that of the object. 

The relation between desert and the intensity of envy is positively 
correlated: the more we believe our inferior situation to be unde- 
served, the greater the intensity of envy. Indeed, there is some em- 
pirical support to the claim that the tendency to feel inferior to 
others and to construe one’s disadvantages as due to unfairness 
predisposes a person to be envious.” When we believe that we de- 
serve an inferior position, for example, when the object is perceived 
to be much better than us, the competitive concern is hardly present 
and envy usually does not arise. 

We are now in a position to resolve the moral paradox of envy 
arising from the fact that it is morally condemned despite its appar- 
ent moral value embodied in the desire to eliminate inequality. The 
paradox is resolved by referring to the combination of the two ma- 
jor concerns in envy: desert and inferiority. The perceived undeserv- 
ingness in envy does not express moral injustice; hence, envy is not a 
moral emotion. Moreover, although we consider our situation to be 
unfair, to express it in public means to admit our inferiority and this 
is difficult to do. Nevertheless, envy involves value claims concern- 


See Parrott. 
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ing the situations in which we think we deserve to be. Hence, it has 
some normative justification. 

Returning to our original question concerning the correlation 
between the intensity of envy and the subject-object inequality, the 
answer would be simpler if envy could be defined in terms of mere 
inferiority or desert. If envy were related to inferiority in general, it 
would be plausible to assume that the greater the inequality, the 
more intense envy would be. If envy were related merely to desert, it 
would be plausible to postulate a mainly negative correlation be- 
tween the intensity of envy and the subject-object inequality, be- 
cause questions of desert are more prominent in smaller inequality, 
where inferiority is less evident. Since we define envy as concerned 
with undeserved inferiority, it is not immediately obvious what the 
relationship is between the intensity of envy and the subject-object 
inequality. Take, for example, the case in which our colleague wins a 
professional prize (which presumably reflects the recipient’s merits). 
If the colleague indeed deserves the prize, then our inferiority is 
evident and envy is likely to arise. If the colleague does not deserve 
the prize, we may feel that we should have received it, once again 
provoking envy. The first case is more typical of envy, the second of 
resentment. In both cases inferiority as well as desert play a role in 
generating the negative emotion. In the next section, I shall try to 


clarify the complex relationship between the intensity of envy and 


the subject-object inequality. 
IV 

The notion of “emotional intensity” is quite complex and cannot be 
discussed at length here. I would merely like to point out a few 
major considerations that are relevant for our discussion. Emotions 
occur when we confront significant changes highly relevant to our 
own life. When we experience an emotion, our physiological and 
mental systems cease to maintain their usually stable state. Emotions 
indicate a transition in which a preceding context has changed, but 
no new context has yet stabilized. The unstable and transient nature 
of emotions is due to significant changes, exceedingly relevant to the 
subject’s situation (and the situation of those related to the subject). 
The stronger, or the more real the change, the more intense the 
emotion. Accordingly, the basic parameter determining emotional 
intensity is the one that measures the impact of the change upon the 
subject’s situation. As this parameter measures the reality of the 
change in the subject’s eyes, it may be termed a “sense of al- 
teration.’””*! 


21! Spinoza strongly emphasizes the importance of changes in the subject's situa- 
ton for the generation and itensity of what he terms “passive emotions.” He 
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The “sense of alteration” involves two kinds of change in the 
subject’s situation: (a) perceived actual changes, and (b) imaginary 
possible changes. The first kind refers to the perception of factual 
changes that have taken place, while the second refers to imagined 
counterfactual changes. The first kind of change refers to actual 
modifications in the subject’s situation. The greater this change, the 
more intense the emotion. The more the subject believes the 
emotion-inducing situation to be real, the more intense is the emo- 
tion. When we know that the danger is real, we are more frightened 
than when we think that the danger could be illusory. Likewise, we 
shall be less envious in seeing a successful person in a movie than in 
a real situation. The second kind of change typical of emotions is 
imaginary. It encompasses a mental construction of a counterfactual 
alternative, to be compared with actual reality. Because of the signifi- 
cance of the change for the subject, the subject is keenly aware of an 
alternative to the current situation. The more available the alterna- 
tive is, namely, the closer the imagined alternative is to reality, the 
more intense the emotion is. Availability (or feasibility) is deter- 
mined by how remote from actual reality the relevant event is consid- 
ered to be. Accordingly, Spinoza argues that the better the wise man 
grasps the fully deterministic nature of events around him, the less 
likely he is to experience passions. That is, the better a person real- 
izes the lack of alternatives, the less likely it is for this person to be 
affected by emotions. Generally, ‘insofar as the mind understands 
all things as necessary, it has a greater power over the affects, or is 
less acted on by them.” Thus, Spinoza” argues that no one pities an 
infant because it cannot speak, walk, or reason; but if most people 
were born full-grown and only one here and there as an infant 
everyone would pity infants. 

Although the first kind of change constituting the sense ór alter: 
ation may be regarded as more objective than the second, it is also 


claims that “we live in continuous change, and .. . as we change for the better or 
worse, we are called happy or unhappy” (Ethics, v, Prop. 39, Schol ). Among 
contemporary philosophers, W. Lyons most strongly emphasizes the role of 
change m generating emotions. He argues that a subject is in an emotional state if 
and only if he ıs in an abnormal physiological state caused by his evaluation of the 
context in relation to himself; see Emotion (New York: Cambridge, 1980), ch 7 
Similarly, the psychologist R. Lazarus argues that an emotional state “is a tran- 
sient reaction to c encounters with the environment, one that comes and 
depending on particular conditions”; Emotzon and Adaptation (New York: 
Oxford, 1991), p. 47. He distinguishes an emotional state from an emotional trait 
that “1s not an emotion but a disposition to react with one” (p. 47). For further 
discussions dealing with some of the difficulties concerning the role of change ın 
generating gain cf. my Emotions toward Others (forthcoming). 
7 Ethics, V, Prop 6 
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essentially a subjective change. It is the subject who perceives the 
change and accordingly considers it more or less significant. More- 
over, changes associated with emotions may not merely refer to the 
subject but also to those constituting the subject’s environment. 
Again, it is the subject who determines which people belong to this 
environment. The subject not only determines the significance of 
the change but also its scope. In a way the two kinds of change are 
similar in measuring the degree of reality of the given change i in the 
subject’s situation. We construct a psychological reality in which 
some changes are smaller than others, since they have less influence 
on the subject’s situation, and some alternatives are closer to reality 
than others, since they are more likely to happen. In constructing 
our reality, we assume that some modifications are much more natu- 
ral and feasible than others, and some attributes are particularly 
resistant to change.” 

The “sense of alteration” is more significant in small subject- 
object gaps where every change can significantly alter the relative 
position of the subject. For example, changes from a superior to an 
inferior position and vice versa are very probable in such gaps. 
When the subject-object gap is large, most changes in the subject’s 
position will be insignificant for the subject’s relative inferiority and 
hence will hardly have emotional significance. This is true concern- 


ing both actual changes and imaginary changes. To illustrate this 
point, consider the following example by Kahneman and A. 


Tversky:™* 


Mr. C. and Mr. D. were scheduled to leave the airport on different 
flights, at the same time. They traveled from town in the same limou- 
site, were caught in a traffic jam, and arrived at the airport 30 minutes 
after the scheduled departure time of their flights. Mr. C. is told that 
his flight left on time. Mr. D. is told that his flight was delayed, and just 
left five minutes ago. Who is more disappointed? 


Mr. D. is more disappointed since catching up with his flight was 
almost possible for him. Mr. D.’s flight was only five minutes away 
from actual reality, whereas Mr. C.’s flight was half an hour away. It 
is easier for Mr. D. to construct a reality in which he catches up with 


B See D. Kahneman and D. T. Miller, “Norm Theory: Comparing Reality to its 
Alternatives,” Psychological Review, xcm (1986): 136-53, pp. 142-3 These au- 
thors claim that “A counterfactual possibility should appear ‘close’ if it can be 
reached by altering some mutable features of reality.” 

™ ‘The Smnulation Heuristic,” in Kahneman, P. Slovic, and pace | 
under Uncertatnty: Heuristics and Biases (New York: Cambridge, 1 fier 
mann and Miller, “Norm Theory.” 
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his flight than it is for Mr. C. Disappointment is stronger when one 
can imagine oneself as being close to the desired object. Similarly, 
we envy people when we can easily imagine ourselves in their place; 
hence, we envy more those who are like us or equal to us. In light of 
the importance of the availability of the alternative, ‘almost situa- 
tions” have additional emotional nuances. As F. Heider?” argues, if 
we hold number 5304 in a lottery and we learn that 5305 is the 
winning number, this near success is probably harder to bear than if 
there were no winning number anywhere near our town. Generally, 
a near success is more frustrating than a simple failure, and misfor- 
tune appears less negative when it is thought of as necessary than f 
when it is seen as something that could easily have been prevented. 

The subject’s inferiority is more obvious but less significant in 
large gaps. In great subject-object gaps, when subjects find it im- 
practicable to establish equality, they no longer find this inequality 
disturbing. Our emotional sensitivity to others’ privileges, and hence 
our own inferiority, increases as privileges become rarer and more 
controllable; attaining the alternative is easier in these circum- 
stances. Accordingly, slight inequality has often greater emotional 
impact than great inequality. 

Assuming a necessary linear correlation between the intensity of 
envy and the subject-object distance is then a false assumption which 
ignores the complex nature of envy. Envy may occur in slight and 
very great proximity—though it usually will not be intense in these 
circumstances. Envy may also not occur in great similarity of back- 
ground such as is found in very close relationships. A Hebrew prov- 
erb states that a person is never envious of a son or a pupil. (Some of 
us doubt if this is even remotely true concerning one’s pupil.) 
Achievements of those very close to us evoke pride rather than envy 
when these achievements are understood (by us or by society) to be 
connected to us in such a manner that we can share the credit they 
bestow. Moreover, very close relationships are complex and rich; 
hence, it is not likely that envy, which focuses upon a partial aspect, 
is common in these relationships. A father usually considers his 
son’s success as part of his own success and not as something sepa- 
rate that threatens his own self-esteem. When those close to us have 


2 The Psychology of Interpersonal Relations (New York. Wiley, 1958), pp. 141- 
4. Needless to say, the generation of envy and other emotions is also a function of 
one’s personality and prevailing social norms. Thus, someone may be leas envious 
by nature than other people, or a certain society may discourage envy. It is as- 
sumed here that these factors are given and the task is to find the variables 
rr greene y: aaao orf uaa lac a a e 
social) factors are changed as the result of the emotional experience, thus becom- 
ing effects rather than causal antecedents of an emotion (Lazarus, p. 204). 
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succeeded in something essential for our own self-evaluation, how- 
ever, or when our ties with these people are not evident, envy may 
replace pride. Thus, envy can play a part in some close relationships 
such as that between parents and children, siblings, or husband and 
wife. A father may wish to be at his son’s age, to have his son’s sexual 
or economic opportunities, etc. He may also perceive his own situa- 
tion to be inferior to that of his son. But a father’s attitude toward 
his son rarely becomes that of intense or malicious envy. For exam- 
ple, a father will usually not aspire to damage his son’s position in 
order to nullify the inequality between them. If such intense envy 
does occur, we can assume that the father-son relationship is not 
very close. It seems that it is more common to see children who envy 
their famous parents than parents who envy their famous children. 
This shows that parents are closer to their children than vice versa.”° 
Generally, social comparison underlying envy requires not only a 
certain proximity, which enables us to place the two items in the 
same comparative framework, but also a certain distance that en- 
ables us to see the two items as distinct. 

Sometimes envy is also-present in cases of large subject-object 
gaps. Thus, we may envy rich or famous people with whom we 
hardly compare and in any case do not think of competing. The 
existence of such cases does not contradict our contention that envy 
is typical of small subject-object gaps. Typical is not the same as 
universal. The tendency to compete with a noncomparable agent is 
reduced but does not disappear in small gaps because conditions of 
noncomparability are relative and continuous, rather than a case of 
all or nothing.”’ Furthermore, the subject-object gap is not the sole 
variable in determining the generation of envy; other variables may 
sometimes be effective even with a large subject-object gap. More- 
over, the inferiority in large gaps is more profound, thought to be 
inadmissible, and cannot be ignored. Another aspect complicating 
the situation is that envy, like other emotions, does not appear in 
isolation but as part of a complex emotional state. Other compo- 
nents may intensify envy since they make the whole state more pain- 
ful. The less significance people attach to their immediate environ- 
ment and the more significance they attach to their position in over- 


© See my ‘Why Did Psammenitus Not Pity His Son?” Analysis, L (1990): 118— 
26; and "Envy and Jealousy”; see also A. Teaser, “Self-Esteem Maintenance in 
ae Dynamics,” Journal of Personality and Soaal Psychology, XXXIX (1980) 
77-91. 

™ P, J. Hoffman, L. Festinger, and D. H. Lawrence, “Tendencies Toward 
Group Comparability in Competitive Bargaining,” Human Relations, vii (1954): 
141-59, p. 144, Dakin and Arrowood. 
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all society, the more likely is envy to appear in what is usually 
considered to be large subject-object gaps. For these people, more 
varieties of inferiority are emotionally significant. 

Although emotional proximity is obviously an important variable 
of emotional intensity, a precise characterization of this variable is 
difficult. I now attempt to clarify this issue somewhat. It may be said 
that greater emotional proximity implies greater emotional intensity 
(or sensitivity). Such a contention does not provide us with much 
information. If S is emotionally closer to X than to Y, then for S$ 
what happens to X is emotionally more significant than what hap- 
pens to Y. Hence, S is emotionally more sensitive toward X; in other 
words, S’s emotional attitude toward X is generally speaking more 
intense than toward Y. The discussion would be more informative if 
we were to discuss at least the following major issues: (a) what con- 
stitutes emotional proximity, and (b) the different kinds of correla- 
tion between emotional proximity and emotional intensity which are 
typical of different emotions. Discussion of the first issue may reveal 
the relationship between proximity in general and emotional prox- 
imity, thus clarifying the nature of emotional proximity. Then we 
shall be in a position to discuss the second issue, which has many 
practical and theoretical implications. 

We may distinguish three major components of emotional proxim- 
ity: (a) similarity in background, for example, education, age, place, 
time, and opportunities; (b) closeness in current positions, for exam- 
ple, status, salary, or possession of a certain object; and (c) relevancy 
for self-evaluation. All components determine the subject’s refer- 
ence group. The first determines the general comparative frame- 
work, whereas the borderlines of the specific reference group are 
also determined by the gap in current positions and relevancy. 
Those who are similar to us are the natural candidates for social 
comparison underlying envy. Since there are divergent properties in 
light of which the comparison can be made, however, the question 
who is included in the reference group is to a certain extent up to 
the subject. Thus, we often exclude from our reference group those 
who are significantly above us though they may be quite similar to us 
in other aspects. The second factor, namely, the gap in current 
positions, is in this sense more objective; it expresses, for example, 
objective differences in salary or rank. 

The inclusion in the reference group is then of great emotional 
significance. Accordingly, we should distinguish between within- 
group (of reference) inequality and between-groups inequality. 
Within-group inequality concerns gaps relating to people with 
whom the subject can compare and conceive of replacing; between- 
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groups inequality refers to gaps related to people who are irrelevant 
to the subject’s self-evaluation and with whom comparison is negligi- 
ble. The suggested negative correlation between the intensity of 
envy and the subject-object gap refers to people belonging to two 
different groups. Concerning within-group inequality, we may envy 
the person with significantly better fortune more strongly than the 
one with slightly better fortune. A private in the army would usually 
envy a corporal or a sergeant and not a field marshal, since a field 
marshal is not part of his reference group. Within the group of 
people closely related to him, however, this person may envy a ser- 
geant or even a lieutenant more than a corporal. Similarly, a worker 
in a big firm may envy the president of the firm less than those 
working with her; but within the group of people working with her, 
she may envy those whose yearly income is $1,500 larger than hers 
more than those earning $100 more than she does. Big gaps in 
current position or salary may prevent us from considering the ob- 
ject a candidate for our reference group despite great similarity in 
background. If despite the gap the object is nevertheless in the 
subject’s reference group, the similarity in background should be 
significant. Within the reference group, members are quite similar 
to each other and hence are perceived to deserve a similar fortune. 
In this case the extent of inferiority is the major variable determin- 
ing the intensity of envy. Outside the reference group there is not 
much similarity in background, the comparison is harder and the 
subject’s inferiority is not the major factor in determining the inten- 
sity of envy. 

The inclusion in a given category is of importance in other realms 
as well. Thus, equal-sized physical differences between stimuli are 
perceived as smaller or larger depending on whether the stimuli are 
or are not in the same category. Two different shades of green look 
more like each other than like any shade of yellow, even when there 
is no greater difference in wavelength between the yellow and one 
of the greens than between the two shades of green.” In perceiving 
colors, inclusion in the same reference group (or category) reduces 
the subjective significance (and hence a given distance within the 
category is less significant than the same distance between a member 
of the category and a nonmember); whereas in emotions like envy, 
such inclusion increases the subjective significance (and hence a 
given distance within the category is more significant than the same 


S Harnad, Categorical Perception (New York: Cambridge, 1987). See also my 
The Perceptual System: A Philosophical and Psychological Perspective (New York: 
Lang, forthcoming). 
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distance between a member and a nonmember). This is compatible 
with the claim that emotions are involved and interested states con- 
cerning those related to us. 

After clarifying the nature of emotional proximity, we are in the 
position to discuss the correlation between emotional proximity and 
emotional intensity. It is obvious that the correlation is different for 
different emotions. Thus, in love we should expect a positive, linear 
correlation between emotional proximity and the intensity of love. It 
seems that emotional proximity is the essence of love. There is, 
however, a nonlinear correlation between the intensity of love and 
similarity in background, which constitutes one component of emo- 
tional proximity. Empirical evidence thus suggests that love is nei- 
ther typical of little nor of considerable similarity.” In describing 
the intensity of envy we should take into account the three major 
components of proximity, namely, similarity in background, close- 
ness (or gap) of current positions, and relevancy. If we take the gap 
and relevancy as constant, we should expect a more or less positive 
correlation between similarity and the intensity of envy. A more 
complex correlation is that between the intensity of envy and the 
subject-object gap (given that relevancy and similarity are significant 
and constant). Without drawing the precise lines of this correlation, 
I may say that the curve depicting it is somewhat similar to a bell- 
shaped curve. Up to a certain width, envy is quite low. From this 
point the gap begins to be large enough to generate significant 
emotional impact. Immediately beyond this point there is a sharp 
increase in the intensity of envy. There is also a point of maximal 
envy after which a widening of the gap will reduce envy. The exis- 
tence of such a point is illustrated by the fact that we are less envious 
of an increase in the income of rich people than of a corresponding 
increase in the income of people who are richer than us but poorer 
than the rich man. In very great gaps an additional increase will not 
affect the intensity of the subject’s envy. In any case, for most peo- 
ple some level of envy still exists on any positive level of the gap.” 

It is not immediately clear whether envy is more painful in large 
or small gaps. Some of the factors responsible for the more painful 
nature of large gaps are: (a) it is harder to ignore or explain away 
large gaps; (b) the subject’s inferiority is more emphasized; hence, 


P D. Byrne and S K. Murnen, “‘Maintainmg Relationships,” In R. J 
Sternberg and M. L Barnes, eds., The Psychology of Love (New Haven: Yale, 
1988) 

x A more precise formulation of the curve is to be found in my Emotions 
toward Others. Sec also S Paul, “An Index of Relative Deprivation,” Economics 
Letters, 1 (1991): 337-41 
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depression and despair are more likely to appear; and (c) there is no 
hope of changing the situation; inferiority is not temporary and the 
subject is impotent and passive with regard to the possibility of 
changing the situation. The following factors are more dominant in 
small gaps: (a) the subject’s inferiority is often perceived as a greater 
injustice since it is considered as stemming from unjust deeds of 
human beings; (b) the variable of controllability is stronger, that is, 
the subject’s personal responsibility for her failure is more empha- 
sized; hence, frustration, self-blame, and regret are more likely to 
appear; and (c) an alternative is more available and the feeling of a 
missed opportunity is stronger. 

The complexity in comparing the intensity of envy in small and 
large gaps is also represented by the types of available mechanisms 
for defending the agent’s self-esteem. Some mechanisms are more 
effective in small and some in large gaps. The major mechanisms 
available in small gaps are: (a) devaluation of the desired item; (b) 
devaluation of the envied person; and (c) assuming that the subject’s 
inferiority is temporary, either because the subject’s genuine value is 
soon to be revealed or other fundamental factors are about to 
change the situation. Although to a certain extent these mechanisms 
are also available in large gaps, they are less effective there since it is 
harder to devaluate large gaps. A major mechanism, available mainly 
in large gaps, is that in which we do not choose those obviously 
above us as friends and acquaintances.*! This mechanism is less use- 
ful in the case of small gaps, typical among friends and acquain- 
tances. Our friends and acquaintances are people in our immediate 
social and economic environment; precisely these are typical objects 
of envy. One mechanism that seems to be effective in both cases is 
where the subject claims that the object’s superiority is not worth 
the sacrifice it requires. 

The variety of conflicting factors participating in envy, its painful 
nature, and the available defense mechanisms make it difficult to 
determine precisely whether envy is more intense and painful in 
small or large gaps. The following general trends may be suggested, 
however: (a) envy is less frequent in large gaps; and (b) if it does arise 
in such gaps, it tends to be either hardly painful or very painful. 
Envy is less frequent in large gaps for the following major reasons. 
First, in large gaps the two people do not belong to the same refer- 
ence group and hence comparison is harder. Second, there are 
fewer people whom we consider profoundly superior to us. Third, 
since it is harder to cope with envy stemming from the profound 


1 See also Elster, “Envy in Social Life,” manuscript (1991); Montaldi. 
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inferiority typical of large gaps, there is a tendency to transpose 
profound inferiority into minor inferiority typical of small gaps. 
Fourth, in many cases where the subject-object gap is large, envy 
may become admiration.®® The second claim, namely, that if envy 
exists in large gaps it tends to be either hardly or very painful, can 
be explained by taking account of the marginal role that a compara- 
tive framework has in such cases. Since comparative considerations 
are hardly present, envy is similar to wishful thinking and is not 
painful. If we insist on taking comparative considerations into ac- 
count in large gaps, then our situation is very painful since it in- 
volves profound inferiority. In Amadeus, Salieri’s envy toward Mo- 
zart is an example of how painful and destructive envy involving a 
large gap can be. Indeed, envy of natural gifts often turns into 
hatred since in both cases the object’s basic and permanent charac- 
teristics are negatively evaluated. The issue of desert, which is more 
pronounced in great proximity, where there is no apparent reason 
for the subject’s inferior situation, can also play a role in large gaps. 
In this case the complaint will be more general, often directed at 
impersonal circumstances. Salieri does not doubt Mozart’s signifi- 
cant superior standing in music, but he doubts God’s judgment in 
bestowing a great musical talent on such a rude and ugly person as 
Mozart.” Hostile or malicious envy, which is the prototypical kind in 
pathological cases discussed in clinical literature on envy, is more 
common in large gaps. We should not reduce envy to hostility, how- 
ever, or vice versa. Envious people dislike their inferior position; 
they do not necessarily feel hostility or dislike toward the envied 


person. 
v 


I now turn to discuss the sociological aspect of the correlation be- 
tween envy and inequality. Although inequality refers primarily to 


™ Envy may turn into admiration if the object is excluded from our specific 
reference group, namely, when the object's success is not relevant in determining 
our worth. A typical case of admiration involves high relevancy of the specific 
domain and little similarity to the specific person: we usually admire those in our 
domain who are not sunilar or close to us and hence do not threaten our self-es- 
teem. In these cases the object’s superiority is typically percetved to be unalter- 
able. Another attitude in this respect is when we perceive the object's good for- 
tune to increase our esteem. This attitude, which typically involves low relevancy 
of the specific domain and little similarity to the specific person, has been charac- 
terized as basking in reflected glory; it is common, for example, in sport People 
feel that they can share in the glory of a successful other with whom they are in 
some way associated By being associated with someone’s success, the subject's 
social as well as self-image are thought to be enhanced. See R. B. Cialdini, R J. 
Borden, A. Thorne, and L R Sloan, “Basking ın Reflected Glory. Three (Foot- 
ball) Field Studies,” Journal of Personality and Soaal Psychology, XXXIV (1976). 
366-75. 

3 See also R. Smith On the relationship between envy and hate, see my “Anger 
and Hate,” Journal of Socal Philosophy, xxm (1992). 
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gaps in current positions, the other factors of proximity, namely, 
similarity in background and relevancy, also play some role in deter- 
mining the level of inequality. I shall examine the level of envy in 
conditions of reduced social and economic inequality, rather than 
perfect equality, since the latter is an ideal of doubtful feasibility 
(and, some may add, moral grounds); anyway, it is not a goal for the 
foreseeable future. Reducing inequalities is a more feasible goal 
which to some extent has already been realized in several societies. 

The central concern in envy is different from the egalitarian 
moral concern that calls for the reduction or even elimination of 
different inequalities. No doubt, by fulfilling the desire underlying 
envy, namely, gaining what someone else has, I may become equal to 
this person in this respect, but this is not what the egalitarian moral 
concern amounts to. As suggested above, envy is not a moral emo- 
tion. Envy differs from the egalitarian moral concern in at least two 
major ways. First, it involves a partial rather than a general concern: 
the envious person is not concerned with equality as a general value; 
the claim to equality is merely a desire to improve the subject’s 
personal situation and thus does not appear when inequality favors 
the subject. Second, envy also surfaces in cases where the demand 
for equality is unrealizable and has nothing to do with egalitarian 
moral principles; for example, when one envies the beauty or wit of 
another person. The claim that egalitarianism is a central concern in 
envy should be rejected. Although this claim is different from the 
claim that envy is a central concern in egalitarianism or that egalitar- 
tantsm is motivated by envy, these claims should also be rejected. I do 
not discuss these views here, but merely repeat that egalitarianism is 
a general, moral stand in a way that envy is not.™ 

A more complex issue concerns the suggested negative correla- 
tion between reduced inequality and the intensity of envy. This 
correlation holds for most types of inequality; a notable exception is 
equality of need satisfaction. Unlike most other kinds of equality, 
this kind does not express a quantitative identity but a qualitative 
similarity that is not associated with a continuous comparison with 
others. This kind of equality increases our overall satisfaction and is 
therefore more likely to reduce envy. The problem is that this is the 
hardest kind to achieve and it is difficult to determine its presence, 
especially under conditions of scarcity. Concerning other types of 
equality, only perfect equality can significantly reduce envy. (Equal- 
ity in need satisfaction is the closest to perfect equality.) 

It is not easy to provide empirical confirmation to the suggested 

H See Schoeck, chs. 14-5, for references to such views; sce R Young, “Egali- 


tananism and Envy,” Philosophical Studies, Lu (1987) 261-76, for a rejection of 
them. See also G. F. de la Mora, Egalitarian Envy (New York. Paragon, 1987). 
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correlation between envy and inequality. The notion of “emotional 
intensity” is not unitary; it applies to different phenomena, not all of 
which are correlated. Emotional intensity can be analyzed in light of 
the different emotional components, for example, evaluation, cog- 
nition, motivation, and feeling, and in each component intensity is 
expressed in a different manner. Such manners may be divided into 
two major aspects: (a) magnitude (or peak); and (b) duration.* De- 
spite the complexities in defining and measuring emotional inten- 
sity, there are empirical scales for measuring some emotions such as 
love, anger, and jealousy. Measuring the intensity of envy is espe- 
cially complex, since a person’s envy is specific to people and do- 
mains of personal importance. It is also difficult to measure the 
intensity of envy in a certain society, as there are no general behav- 
ioral rules that express the presence and intensity of envy unequivo- 
cally. Nevertheless, there is some empirical evidence suggesting that 
the inclination toward equalization of fortunes is strongest when the 
subject and the object are more or less on equal footing, and that 
envy does not diminish with reduced inequality but often even in- 
creases. One type of such evidence refers to the degree of satisfac- 
tion of Black Americans from different socioeconomic levels. It was 
found that as Blacks move up the economic and social ladder, they 
become leas satisfied and more likely to take part in protest or riot 
activities. There is further evidence supporting the relative depriva- 
tion theory which claims that ‘‘stratified societies can tolerate higher 
inequality than unstratified societies. As people become more en- 
gaged with each other, they have less tolerance for a given level of 
inequality.”** In what follows, I offer a further step in clarifying the 
complexity of the envy-inequality correlation. I do not present re- 
sults of empirical investigations, but analyze the level of envy in one 
of the most egalitarian contemporary societies, the Israeli kibbutz. 
This analysis illustrates the role in the generation of envy of the 
factors discussed above. 

The Israeli kibbutz is by and large an egalitarian society whose 
members spend a great deal of their life together and enjoy equal 
standing with respect to the basic needs of food, health, education, 
and access to culture. Inequality is minimal and is usually repre- 
sented by different quantities of luxury items or in the type of work 


# See N. H. Frijda, A. Ortony, J Sonnemans, and G. Clore, ‘The Complexity 
of Intensity Issues concerning the Structure of Emotion Intensity,” in M. S. 
r ed , Emotion: Review of Personality and Social Psychology, xm (1992): 60— 

my Emotions toward Others. 

arr Yitzhaki, and R. I. Lerman, “Income Stratificanon and Income Inequal- 
ity” (working paper, Department of Economics, The Hebrew University of Jerusa- 
lem, 1989). See also Crosby; Heider, pp. 288-9; Schoeck. 
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done by each member. It is instructive to find out whether the signif- 
icant reduction of inequality in this society has led to a reduction in 
the level of envy. I first indicate characteristics of kibbutz society 
which increase the level of envy. 


1. There is great similarity in the members’ background and current 
situations and there are small differences in the members’ monetary 
resources and ways of life. When easily perceived differences are 
infrequent, smaller differences become easier to perceive and ac- 
quire greater emotional significance. Moreover, when a society is 
not large or flexible enough to permit segmentation into groups, it 
might not be able to maintain a variety of opinions, abilities, and 
fortunes, thereby increasing similarity and hence envy. 

2. Personal encounters are frequent, making it difficult to forget pre- 
vailing envy: the dynamic of “out of sight, out of mind” does not 
exist. (The size of each kibbutz varies from a population of about 
100 to about 1,500, small enough for frequent encounters and 
keeping people informed about the fortunes of each one.) 

3. Since the background and opportunities of each member are more 
or less similar, one can only blame oneself for one’s inferior posi- 
tion. Greater personal responsibility usually intensifies emotions. 

4. Envy has more legitimacy than in other places. There are two major 
reasons for this. (a) Egalitarian ideology justifies condemnation of 
inequality and thus may be perceived as implying some justification 
of envy. Moreover, because of the predominance of the egalitarian 
ideal, people are often preoccupied with equality for its own sake, 
thereby decreasing their readiness to be satisfied with their own 
situation and thus facilitating the generation of envy.” (b) In small 
subject-object differences, the subject feels more entitled to have 
what the object has than in large differences. 

5. There is a stronger and more justified feeling that the other’s gain 
means the subject’s loss. This feeling, which is not a necessary fea- 
ture of envy (it is more so of jealousy), is common in the kibbutz 
where the gross income 1s shared by all members; if one gets more 
than other members, these members are in a certain sense support- 
ing this person. 

6. Envy can have practical results: the society is actually capable of 
demoting envied people from their position. This can be done ci- 
ther by making them less welcome from a social viewpoint, or by not 


approving many of their requests. 


The first three characteristics express the great proximity in the 
kibbutz. The next two characteristics suggest why the element of 
desert is stronger in this society. The last characteristic indicates that 


Y See also H. Frankfurt, “Equality as a Moral Ideal,” in The Importance of What 
We Care About (New York: Cambridge, 1988), p. 135. 
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envy has practical results. All characteristics increase the availability 
of an alternative, namely, the likelihood of reversing the subject’s 
present inferior situation; hence, the intensity of envy increases. 

Some characteristics of kibbutz society may reduce envy, however. 
The major ones are the following. 


1. Kibbutz society 1s less competitive in many areas. Thus, the kibbutz’s 
egalitarian ideology attaches less importance to economic and pro- 
fessional standing 

2 An envious person 1s usually ignorant of many of the object’s quali 
ties, focusing attention only on what is envied. The desired qualities 
receive therefore disproportionate attention from the envious per- 
son. In the kibbutz, this distortion is less likely to occur because 
there is more intimate acquaintance with all qualities, including the 
shortcomings, of the envied person. As suggested above, a broader 
perspective usually decreases emotional intensity. (Of course, the 
more intimate acquaintance may induce further envy as when the 
object is superior to the subject in these areas as well. Indeed, in 
classical antiquity it was assumed that whoever excels in one respect 
excels in all. But since no one is perfect, it is always possible to find 
shortcomings and to assign them disproportional weight.) 

3. An individual’s social circle is not limited to those of a similar 
economic and professional standing—the primary candidates for 
our envy. 

4. Reducing inequalities often leads to an increase in self-satisfaction 
—at least for those in inferior positions, who are the ones who 
experience envy. Since the absence of self-satisfaction is an impor- 
tant component in envy, its increase may lead to a lower level 
of envy. 


The above analysis indicates that even in this small and largely 
homogeneous society, different factors have conflicting influences 
on the level of envy. These conflicting factors make it hard to deter- 
mine the comparative level of envy in each kibbutz with certainty. 
Moreover, personal and contextual factors typical of each particular 
kibbutz would often have to be taken into account in order to deter- 
mine the exact level. Although the main purpose of the above analy- 
sis was to illustrate the presence and nature of the factors underly- 
ing envy, it can also serve to indicate the comparative level of envy in 
this egalitarian society. The major factors raising the intensity of 
envy in kibbutz society are greater proximity (concerning both simi- 
larity in background and closeness in current positions), stronger 
desert claims, and greater availability of an alternative. The major 
factors reducing envy are less competition, broader personal ac- 
quaintance and less profound inferiority. In light of our analysis of 
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envy and emotional intensity, it seems that the first group has 
greater weight; hence, envy is likely to be stronger in this society.” 

A similar analysis can be conducted with regard to modern West- 
ern society in general. I do not intend to carry out such an analysis 
here but to point out that the level of envy seems to increase also in 
this society. The major factors responsible for this are: (a) modern 
Western society is less stable than traditional societies, and instabil- 
ity is a major variable in the rise of emotions;® (b) modern Western 
society is much more competitive than its predecessors; (c) the com- 
munications revolution brings people closer to each other; and (d) 
ideological condemnation of huge social and economic gaps is in- 
creasing—the present hierarchy is no longer accepted as a justified, 
natural phenomenon. There are of course conflicting factors, for 
example, improved living conditions and greater stress on personal 
development. I cannot compare the weight of these and other fac- 
tors here, but again the factors responsible for increased envy seem 
to be more central. 

The presence of a mainly negative correlation between inequali- 
ties and envy does not imply that we should not strive to reduce 
inequalities; reducing inequalities may have positive results of a dif- 
ferent sort. We should be aware of the consequences (vis-a-vis envy) 
of such a development, however. Reducing inequalities may lead (as 
Rawls argues) to a more just society; but (contrary to Rawls) this 
society may witness a rising level of envy. When social and economic 
gaps are large, the chances for rage, hate, frustration, and several 
types of violent reactions are greater. When these gaps narrow, the 
intensification of envy usually follows, along with a reduction in 
these attitudes. If we are to witness a further reduction in social and 
economic inequalities, we should expect the problem of envy to 
become more prominent. Whatever social and moral advantages re- 
duced inequality has, reduced envy is not one of them. 

VI 


What are the possible ways of reducing envy? One obvious way is to 
increase the subject-object distance, namely, to decrease the num- 


™ Kibbutz ideology 1s aimed at equality of need satisfaction; the ideal 1s sup- 
posed to be “from each according to his ability, to each according to his need.” 
As mentioned, this equality ıs the most likely to reduce envy but the hardest to 
achieve. The areas in kibbutz’ hfe which are close to fulfilling this ideal kind of 
equality are food, health, and to a lesser extent education. It seems that in these 
areas, where inequality of need satisfaction is significantly reduced, the level of 
envy a reduced as well. 

In the political realm, changing one’s positon 1s facilitated by the democratic 
structure of modern societies. Isocrates argues that monarchy is preferable to 
democracy since it involves less envy’ the king 1s so far above his subjects as not to 
excite envy. See Walcot, p 30 
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ber of people who are of emotional significance to us. It seems, 
however, that reducing envy is not worth the price of widening the 
subject-object disparity in this manner. The harmful consequences 
would exceed the benefits. Another related way is to reduce the 
availability of an alternative. Again, reducing envy is not worth this 
price of limiting one’s opportunities which may reinforce despair 
and indifference and debilitate hope in human life. 

Envy can be reduced by making social comparison, and hence 
competitiveness, less significant. Two major means for doing it are 
diversifying valued domains and settling for less. The first means is 
that of expanding the range of valued domains in which people can 
perform well compared to others.*° Thus, chances increase that 
more people will enjoy superiority in at least one domain. Since we 
can expect people to consider this domain to be quite relevant in 
assessing their value, we can expect the reduction of envy and com- 
petitiveness. Diversifying valued domains requires, then, two ele- 
ments: (a) adding new types of domains a person can be satisfied 
with; and (b) the realization that each domain has its own intrinsic 
value. Achieving these objectives, which involves stressing the intrin- 
sic value of people (including oneself), increases dissimilarity among 
individuals; hence, it offsets the increased similarity stemming from 
reduced socioeconomic gaps. 

The second means to lessen the significance of social comparison 
is that of settling for less. In this case another person’s success will 
bother us less. As A. Parducci*! remarks, “With lower standards, 
regular accomplishments would be much more satisfying.” Take the 
case of a man who has an exalted picture of the ideal woman in his 
imagination, a standard that no real-life woman could meet. Because 
of such exalted standards, all his romantic relationships are doomed 
to be disappointing. If this man lowers his standards, he can be 
happier. What now appears to him as “the best” may be a real-life 
woman, with all the faults that previously seemed too grievous to 
bear. If we can truly settle for less, happiness is likely to increase and 
envy is likely to decrease since we are more likely to gain what we 
want. Some societies, such as the kibbutz, foster the ideal of being 
content with little, which is similar to the ideal of settling for less. 


* See R. Nozck, Anarchy, State, and Ui (New York: Basic, 1974), R. H 
Smith, E. Diener, and R. Garonzik, ‘The Roles of Outcome Satisfaction and 
Comparison Alternatives in Envy,” Bnitesh Journal of Social Psychology, XXIX 
(1990): 247-55 


Sya oe ts: Toward a Relanonal Theory of Happiness,” in J. R Ewer, 
(New York: Springer, 1984), pp 14-5 for leas 
TE geen recommen Hone Ghee level of success can really be achieved; in 


this case, Parducci argues, a co between achievement and happiness arses. 


ENVY AND INEQUALITY 581 


Both attitudes realize a similar evaluation concerning the limited 
value of the continuous race to win. But whereas the first ideal 
addresses absolute deprivation, the second concerns relative depri- 
vation. Hence, for the purpose of reducing envy, the second ideal is 
more appropriate. 

We may combine the suggested two means to reduce envy by 
speaking about settling for being different. In some cases different 
may be less, but in others it may just refer to a noncomparable 
domain. Adopting these two means represents a kind of maturity of 
the subject (and society) in which social comparisons are less signifi- 
cant in determining the subject’s happiness. Such determination be- 
comes more intrinsic to the subject’s own activity. Other means to 
reduce envy are those effective in reducing emotional intensity in 
general; for example, making one’s perspective less partial. Al- 
though these means are not easy to employ, they have a good chance 
of reducing envy, especially if we realize their role in increasing our 
happiness. 

In summary, I have argued that envy is not a moral emotion and 
that the subject’s inferiority is a central concern in envy. This is not 
the only concern, however, since there is no positive correlation 
between envy and inferiority. Another important concern is that of 
desert. Accordingly, the focus of concern in envy is the subject’s 
undeserved inferiority. This concern does not imply a positive 
correlation between envy and the subject-object inequality, since 
questions of desert are more prominent regarding those who are 
similar and close to us, namely, comparable with ourselves. Deter- 
mining the type of this correlation should take into account whether 
the object is within or outside the subject’s reference group. Envy is 
more frequent within the reference group where the correlation 
between the intensity of envy and subject-object inequality is usually 
positive. Outside the reference group, the correlation is often nega- 
tive since greater inequality makes the subject-object ‘comparison 
less significant. This correlation holds also on the sociological level. 
Accordingly, we can expect that reduced socioeconomic gaps will 
increase envy. Major means for reducing envy are those making 
social comparison less significant. 

AARON BEN-ZE'EV 
University of Haifa 
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THE IDENTITY THEORY OF QUOTATION* 


uppose I want to talk about a bit of language, say, the title of 

this essay. How might I go about doing this? I can give it a 

name. ‘U’ will do. I can then say that U has twenty-eight 
letters, and what I say will be true. Another option available to me is 
to offer a description. I can describe the title in terms of its constitu- 
ent words. I can give the names ‘V’, ‘W’, ‘X’, ‘Y’, and ‘Z’ to the five 
words that compose it. I can then say the expression consisting of V 
followed by W followed by X followed by Y followed by Z has 
twenty-eight letters, and again what I say will be true. Yet another 
option is to demonstrate it. I can say, pointing to the top of the 
page, that that expression has twenty-eight letters, and once again I 
shall say something true. 

Naming, describing, and demonstrating are distinct ways of men- 
tioning the title. Together they exhaust the standard ways of men- 
tioning nonlinguistic objects. But most of us would balk at the idea 
that our options for mentioning language are limited to these three. 
Surely, if I want to mention the title, I can simply quote it. I can talk - 
about the title by saying that “The identity theory of quotation’ has 
twenty-eight letters, and what I say will be true. But in quoting the 
title, am I doing something other than naming it, describing it, or 
demonstrating it? This is the main question which I want to address. 
The answer I suggest is “yes.” I argue that the use of expressions in 
quotation is fundamentally different from nonquotational uses of 
language. 

I 

In natural language, quotation is a way of mentioning things. Its use 
is easy to learn and once learned it is readily applied and under- 
stood. These qualifications make quotation a phenomenon worthy 
of explanation. The fact that quotation, like the modalities and the 
attitudes, gives rise to opacity makes it a problem for the philosophy 
of language. As such problems go, however, quotation would seem 
to be on the easy side. Its workings appear to be a paradigm of 
simplicity: in quotation, it seems that words are used and mentioned 
in the same breath. Moreover, this correlation would appear to be 
explanatory: it seems that words in quotation are mentioned simply 
by being used. 


^ I have received helpful comments from John Perry, Jamie Rucker, Michael 
O'Rourke, Henry Smith, Johanna Carr, and a referee for The Philosophical Re- 


view. 
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In what follows, I shall argue that, where quotation is at issue, the 
appearances are the reality. I shall argue that, in quotation, expres- 
sions are used to mention themselves and that this correlation is 
rule-governed. I call this view the identity theory, which is based on 
the idea that linguistic expressions may be used in different ways. 
According to the theory, a special quotational use of expressions 
underlies the phenomenon of quotation. The quotational use of an 
expression contrasts with what I call its standard use. The paradigm 
of standard use is the use of expressions outside of quotation in 
unembedded contexts. The contrast is illustrated in (1): 


1. ‘Three words long’ is three words long. 


The first use of the expression ‘three words long’ is quotational, 
while the second is standard. The first is interpreted by the principle 
of identity, the second is not. 

Alfred Tarski, W. V. Quine, Peter Geach, and Donald Davidson 
are among those who find the appearances misleading. These 
writers maintain that while quoted expressions may seem to mention 
themselves, the identity is illusory and, in point of fact, the expres- 
sions used are not what is mentioned. They also deny that the princi- 
ple of identity or any related principle plays a role in the interpreta- 
tion of quotation. They recognize only the genera of uses that I call 
standard, and advocate theories that treat quotation as involving a 
standard, though perhaps disguised, use of names, descriptions, or 
demonstratives. The quotation in (1) is thought to be analogous to 
(2), (3), or (4): 


2. A is three words long. 

3. The expression consisting of B followed by C followed by D is three 

words long. 

4. That expression is three words long. 

(Where ‘A’, ‘B’, ‘C’, and ‘D’ are names and ‘that’ is a demonstrative.) 

I call these theories “‘standard.”” A standard theory is one that 
analyzes quotation on the model of a standard use of an expression. 
Below I shall explain how standard theories differ from the identity 
theory and attempt to show why the commitment to standard se- 
mantic concepts makes it impossible to give a correct answer to what 
would seem to be a very simple problem. My choice of the term 
‘standard’, which I contrast with ‘quotation’, is meant to underscore 
the essential difference between quotation and other uses of 


It will help to prevent confusion later on if I take a moment to 
formulate some terminology and draw some distinctions. At the out- 
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set, it is important to distinguish expressions from their tokens. The 
former are types, which are abstract particulars. The latter are to- 
kens, which are concrete particulars. When I say ‘expression’ I mean 
expression type, and when I say ‘token’ I mean expression token. I 
call an expression a “mentioning expression” if it is used to mention 
something. Being a mentioning expression is a property that an 
expression has on a particular use. My use of the verb ‘mentioning’ 
is meant to be neutral between different ways of talking about ob- 
jects. I use ‘semantic value’ as a term for the contribution an expres- 
sion makes to the interpretation of sentences of which it is a part. I 
use ‘object mentioned’ or ‘mentioned object’ as terms for objects 
that are mentioned. Objects mentioned are the semantic values of 
mentioning expressions. When I say that the semantic value of an 
expression has been ‘stipulated’, I mean that the expression ac- 
quired its value as a direct result of the performance of a particular 
act or acts. I do not mean to imply that the acts were performed with 
the intention of giving the expression a value. 

To give a precise analysis of the semantics of quotation, three 
basic questions have to be answered: 


5. What expressions are used and what roles do they play? 

6. What type of object is mentioned? 

7. How is the semantic value of the mentioning expression deter- 

mined? 
I shall let these questions guide my discussion. In the next section, I 
review some well-known standard theories of quotation and the an- 
swers they give to questions (5)-(7). In section ITI, I briefly summa- 
rize the identity theory and the answers it gives to questions (5)-(7). 
Sections IV-VI address questions (57) in detail. I provide reasons 
why they should not be answered in the manner of the standard 
theories and why they should be answered in the manner of the 
identity theory, taking the occasion to go more deeply into details of 
the proposals. 
H 


Question (5) asks us to determine the parts of a quotation, if there 
are any, and assign credit for overall effects where credit is due. 
Given the orthographic structure of standard written quotation, the 
candidates for the mentioning expression are limited to three. The 
mentioning expression could be (a) the expression consisting of the 
marks and what stands between them, which I term the ‘quotation as 
a whole’; (b) the expression between the marks, the ‘quoted expres- 
sion’; or (c) the quotation marks alone. Tarski, Quine, and Geach 
have advocated the view that it is the quotation as a whole which 
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mentions an object, while Davidson is responsible for the novel sug- 
gestion that the quotation marks are complete mentioning expres- 
sions in themselves and that the quoted expression is not used at all. 

A view held by Tarski, Quine, Geach, and many others regarding 
what is mentioned in quotation, question (6), is that quotation is a 
device for talking about linguistically interpreted types. These types 
come in different varieties. On one theory advocated by Tarski and 
Quine, these types are concatenations of letters. Geach’s explicit 
position is that the object mentioned is a concatenation of words. 
Taking a subtly different stance, Davidson holds the objects men- 
tioned by quotations to be shapes.’ Davidson and the advocates of 
the theory of interpreted types are united in maintaining that in 
quotation we do not talk about tokens. Disagreements between Da- 
vidson and type theorists concern the boundaries of the types and 
whether the types are linguistically interpreted; a linguistically inter- 
preted type may encompass many distinct uninterpreted shapes. 

Question (7) asks about how the value of a quotation is fixed. The 
answer must give an account of how it is determined which object a 
given use of quotation mentions. In this connection, Tarski and 
Quine have promoted what has come to be called the “name 
theory.” This is the view that quotations are standard occurrences of 
semantically simple expressions.? As Tarski remarks: 


Quotation-mark names may be treated like single words of a language, 
and thus like syntactically simple expressions. The single constituents 
of these names—quotation marks and the expressions standing be- 
tween them— fulfil [sic] the same function as the letters and complexes 
of successive letters in single words. Hence they can possess no inde- 
pendent meaning. Every quotation-mark name is then a constant indi- 
vidual name of a definite expression (the expression enclosed by quota- 
tion marks) and in fact a name of the same nature as the proper name 
of a man (op. at, p. 159). 


An expression is semantically simple if its standard semantic value is 
not fixed by general principles. Words like ‘and’, ‘car’, and ‘hello’ 
are semantically simple. To say that a quotation is a standard occur- 
rence of a semantically simple expression is to say that its value is not 


' Davidson seems to be influenced by Tarski, who held that quotations refer to 
classes of expressions of like shape See Tarski, “The Concept of Truth in a 
Formalized purei in Logic, Semantics and Metamathematics (New York: Ox- 
ford, 1969), p. 156n. 

? Jonathan Bennett reports that Quine has maintained, in personal aa 
tion, that in his wntings he uses the term ‘name’ as a catchall alas 
sions that are semantically smple—“Quotation,” Nows, xxi (1988): 3004 8 
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given by general principles, that some stipulation must have been 
made for it to have obtained its value. 

Along other lines, Tarski (op. cit., p. 160) and Quine’ have also 
suggested that quotations be analyzed as standard uses of collapsed 
descriptions, which describe expressions by their spelling. This 
theory, which has been called the “spelling theory,” treats the quo- 
tation ‘hello’ as a short form of the description “the expression got 
by writing ‘h’ followed by ‘e’ followed by ‘l’ followed by ‘l’ followed 
by ‘o’.” Another version of the description theory, given by Geach,* 
is identical except that the basic parts of descriptions mention indi- 
vidual words rather than letters. He writes: “I maintain that the 
quotation ‘man is mortal’ is rightly understood only if we read it as 
meaning the same thing as “ ‘man’ & ‘is’ & ‘mortal’,” i.e. read it as 
describing the quoted expression in terms of the expressions it con- 
tains and their order” (tbid., pp. 82-3). I call this the “lexical” de- 
scription theory. It is important to note that neither the spelling nor 
the lexical theory gives an account of the semantics of the basic parts 
of the descriptions. These theories treat the parts as semantically 
simple, a subtlety which is obscured by the use of quoted letters or 
quoted words. 

The range of views is represented in the following chart: 


semantic mentioning method of 

Proponent category expression fixing mentioned object 

T/Q name* quotation as stipulation expression 
a whole 

T/Q description quotation as stipulation for series of letters 
a whole parts 

Geach description quotation as stipulation for series of words or 
a whole parts umambiguous 

expression 

Davidson demonstrative quotation demonstration shape 
marks 

Frege anything quoted principle of expression 
expression identity 

Searle anything quoted princtple of expression 
expression identity 

CGW anything quoted principle of expression 
expression idenuty 

CHART A 
3 Word and Object (Cambridge: MIT, 1964), p. 212. 


1 Mental Acts (New York: Routledge, 1957). 
* See fn. 2. 
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I 
With regard to the issues raised by question (5), the identity theory 
diverges from Tarski, Quine, Geach, and Davidson, while agreeing 
with John Searle and Gottlob Frege. In “On Sense and Meaning,” 
Frege® writes: 


If words are used in the ordinary way, what one intends to speak of is 
what they mean. It can also happen, however, that one wishes to talk 
about the words themselves or their sense. This happens, for instance, 
when the words of another are quoted. One’s own words then first 
designate words of the other speaker, and only the latter have their 
usual meaning. We then have signs of signs. In writing, the words are in 
this case enclosed in quotation marks. Accordmgly, a word standing 
between quotation marks must not be taken as having its ordinary 
meaning (tbid., p. 201). 


In this passage, Frege is primarily concerned with direct and indi- 
rect discourse reports. His comments about direct discourse also 
apply, however, to uses of quotation where words are mentioned 
and discussed but not attributed to a person as something said. 
Frege’s view is that the use of an expression on the interior of a set 
of quotation marks is special. Quotation marks, which are most 
common in written speech, signal this new use. John Searle’ puts the 
point succinctly when he writes that in quotation “‘a word is uttered 

. but not in its normal use. The word itself is presented and then 
talked about, and that it is to be taken as presented rather than used 
conventionally to refer is indicated by the quotes” (ibid., p. 76; my 
emphasis). The quotation as a whole is analyzed into the marks that 
signify quotational use of the quoted expression and the quoted 
expression that is itself used to mention an object. All expressions, 
even those whose standard uses are not as mentioning expressions, 
become mentioning expressions in quotation. 

As regards the question of what quotation mentions, raised in (6), 
I maintain that the most common use of quotation is to mention 
linguistic types. These types may be interpreted or uninterpreted, 
like ‘man’ or ‘ggrmph’; they may be structured or unstructured, like 
‘this man’ or ‘man’. If uninterpreted, however, they must at least be 
potential bearers of linguistic characteristics. In reply to question 
(7), I hold that a quoted expression is related to its value by identity: 
a quoted expression mentions itself. The identity theory’s distinction 
between standard and quotation uses can be represented schemati- 
cally as follows: 


®In The Philosophy of Language, Martinich,ed. (New York. Oxford, 1985) 
7 Speech Acts (New York Cambridge, 1989). 
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semantic quotational use 
value ———_ ©) 
a A a 
token expression ————-—--— > semantic value 
standard use 
CHART B 


A solid line marks the fact that the relation is determined by general 
principles. I use a dotted line to connect the expression with its 
standard semantic value to indicate that the connection is deter- 
mined by stipulation. Stipulations are not general. Since a stipula- 
tion may or may not have been carried out, a basic expression may 
or may not have a standard semantic value. What is commonly called 
the “lexicon” of a language consists of those basic expressions 
which have standard semantic values. 
IV 


What are the parts of a quotation and what role do they play? An 
important fact about quotation which seems to have been over- 
looked by those who say that the quotation marks either are the 
mentioning expression or form an essential part of the mentioning 
expression is that quotation marks may be absent. In conversation, 
oral promptings (“‘quote-unquote’’) or finger-dance quotes can of- 
ten be omitted without impairing the intelligibility or well-formed- 
ness of the utterance. When I introduce myself, I do not say “my 
name is quote-unquote Corey,” nor do I make little finger gestures 
or even use different intonation in order to show that it is my name 
and not myself that is being talked about. Linguists do not say 
“quote-unquote flying planes can be dangerous is ambiguous.” Phi- 
losophers do not say “the belief report quote-unquote ....” In 
general, when it is clear that the subject matter is linguistic, we 
simply utter the expression without marking it in any special way. 
These mundane facts show that Tarski’s, Quine’s, and Geach’s 
proposals do not apply to the vast majority of spoken uses of quota- 
tion. These writers claim that the quoted expression is a mere acci- 
dent in the course of the spelling of a longer word, which contains in 
addition the two quotation marks.* It is this longer word, the quota- 
tion as a whole, which is the mentioning expression. But when quo- 
tation marks are lacking, the quotation as a whole does not occur. 


* Quine, “Reference and Modality,” in From a Logical Posni of View (New 
York: Harper & Row, 1961), p. 140. 
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So, on this proposal, most quotations do not exist—a conclusion 
that is difficult to accept. 

Davidson’s® thesis is vulnerable to the same counterarguments. 
He conceives of quotation marks as referential expressions, roughly 
synonymous with the locution, ‘the shape of that’: 


On my theory which we may call the demonstrative theory of quota- 
tion, the inscription inside does not refer to anything at all, nor is it 
part of any expression that does. Rather it is the quotation marks that 
do all the referring, and they help to refer to a shape by pointing out 
something that has it (ihd., p 90). 


On Davidson’s account, the subject of the sentence in (8) is the 
quotation marks: 


8. ‘Hello’ is a salutation. 

That is, (8) appears as: 
8’. ‘’ is a salutation. 

And when uttered without quotation gestures as: 
8”. is a salutation.!° 


Davidson’s theory does not provide for a subject to replace the 
quotation marks when quotation gestures do not accompany the 
utterance.!! Unless some other element can be found to serve as the 
subject of the sentence, the theory makes the manifestly false pre- 
diction that spoken uses of (8) are ill-formed. 

It has been suggested in defense of Davidson’s proposal that the 
quotation marks in an utterance of (8) are implicit.? These implicit 
quotation marks would then serve as the subject in (8°), saving the 
structure from ill-formedness. There are circumstances in which a 
sentence may be perfectly well-formed and yet lack an explicit ex- 
pression in a subject position. In general, imperatives may go sub- 
jectless, as may declarative sentences in languages such as Spanish 
and Italian (“pro-drop languages”). Contemporary linguistic the- 
ories commonly postulate implicit expressions in place of their miss- 
ing explicit counterparts to explain the syntactic and semantic prop- 


? “Quotation,” ın Truth and Interpretation (New York: Orford, 1984). 

E eA RES theory of indirect dis- 
course 1s given by Gabriel in “A Preference for Sense and Reference,” this 
JOURNAL, LXXXVII, 2 feared 1991) 73-89. 

1! Indeed, it is difficult to see how gestures, even when present, could serve in a 
syntactic capacity in spoken or written language. 

1? This is a response which I have often heard. I cannot recall all of the people 
who have suggested it. Stephen Neale is one 
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erties of these constructions. Nevertheless, these theories are care- 
ful to place tight restrictions on the contexts in which implicit 
elements may occur. These restrictions exclude the situation in (8%. 
The predicate in that sentence cannot license an empty element. 
Current theory provides no support for the claim that (8%) contains 
an implicit subject.'® In the absence of further evidence for the 
existence of an implicit subject in (8°), Davidson’s suggestion that 
quotation marks are mentioning expressions must be rejected. 

The facts about written quotation are similar. In writing, we can 
also dispense with quotation marks. We can indicate that an expres- 
sion is being quoted by writing it on a separate line or in italics. One 
must be pressed very hard to adopt a Quinean point of view and see 
the italicized form of the quoted expression or the quoted- 
expression-on-another-line as distinct from the quoted expression, 
as if italicization or setting an expression off created a new expres- 
sion. There is even less reason to attempt to persevere in Davidson’s 
way of looking at things and reify the italicization or the being-on- 
another-line of the quoted token into a mentioning expression. 

Yet writing does differ from speech in some important respects. 
Unlike spoken quotation, it appears that some signal is required to 
indicate that the expression is to be quoted and not used as usual. 
Though standard in speech, (9) is simply not acceptable written 
language.!4 

9. Hello is a salutation. 


The fact that (9) is ill-formed should not be taken as evidence that 
quotation marks form an essential syntactic or morphological part 
of the sentence, and hence as evidence for a position like Quine’s or 
Davidson’s. Causes of ill-formedness are varied. Written quotations 
without quotation marks are simply incorrectly punctuated and are 
deviant in the same way that other incorrectly punctuated expres- 
sions are deviant: 


10. Did you eat your peas. 
11. John who went to school liked to stay up late. 


There are a number of ways of explaining how punctuation can 
effect syntactic well-formedness without supposing punctuation 


13 Tt should be clear that any thought of salvaging Rume a theory by supposing 
empty elements (empty letters, in this case) in place of the missing quotation 
marks should be met with at least as much skepticism as it is in the case of 
Davidson's theory. 

4 Sometumes something like this is written: 

He 1s called John. 
But here the verb ‘called’ signals that a quotation is to follow. 
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marks to be words or parts thereof. A view I find attractive is that in 
common written language punctuation marks play a role analogous 
to the role phrase-structure markers play in linguistics. Just as 
phrase-structure markers signal that a linguistic element has a cer- 
tain structure or belongs to a certain category, so too do punctua- 
tion marks signal that an expression has a certain linguistic charac- 
ter. Neither are grammatical constituents of the expressions they 
accompany. If a sentence is followed by an exclamation point, then 
it is a command or an exclamation; a comma might be a sign of a 
break between a series of constituents; an expression flanked by 
quotation marks is a quotation.’ The absence of the appropriate 
marks would signal that the expression lacks the characteristics in 
question. (This is necessary to ensure that expressions without punc- 
tuation are not systematically ambiguous between punctuated and 
unpunctuated readings.) The sentence in (9) would be ill-formed 
because the word ‘hello’ would have its normal use as an interjection 
in a context where a noun phrase is required. 

I believe the simplest and most natural way to explain these facts 
is to take the quoted expression to be the mentioning expression. 
Quotation marks, on this account, are neither mentioning expres- 
sions nor parts of mentioning expressions. Quotation marks are 
punctuation. As punctuation, they signal that the quoted expression 
has a special use.'® They take their place alongside the period, semi- 
colon, question mark, and comma.” The differences between writ- 
ten and spoken quotation can be attributed to the more formal 
character of written expression. 

Although this account strikes me as the most plausible, it is not 
the only one that is compatible with the facts. We have seen that 
there are cases in which quotation marks are present and cases in 
which they are not; spoken quotation can occur with or without 


15 For a detailed discussion of the syntax and semantics of punctuation, see 
Jeffrey Nunberg, The Linguistics of Punctuatson (Stanford: CSLI, 1990). 

'© Quotation is sometimes described as a linguistic context in which words take 
on a different significance This way of is not entirely accurate, since 
contexts are typically marked grammatically, but the grammatical signs of quota- 
tion (the marks) are often absent. 

17 Sanskrit exhibits a quotative marker (it) which is placed after the quoted 
material; see J S. Speijer, Sanskrit Syntax (Leiden: Brill, 1886), pp. 87 : 

‘ham iti vadati 

Kmg-NOM I-NOM QUOTATIVE-MARKER he-says 

He says “I am the king” 

(I am endebted to Henry Smith for this pomt.) 

According to the Encyclopedia Britannica, quotation marks were introduced in 
the West along with the exclamation point around 1550. Prior to this time, their 
work was done by contextual factors. 
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quotation marks, though marks are usually not present; direct quo- 
tation in writing may occur with or without marks, though the use 
must be indicated in some fashion. We could say that there are two 
sets of mentioning expressions which can be used in quotation, one 
consisting of expressions with quotation marks (the quotations as a 
whole) and another consisting of expressions without quotation 
marks (the quoted expressions). There will be: ‘hello’ and ‘ ‘hello’ ’, 
‘goodbye’ and ‘ ‘goodbye’ ’, ‘stop’ and ‘ ‘stop’’, and so on. Coun- 
terparts in each set will always mention the same object. Although 
this proposal accords with the facts, it is still undesirable insofar as it 
leaves unexplained the fact that expressions with quotation marks 
always mention the same thing as expressions without quotation 
marks. Another possible explanation is that the quoted expression 
mentions the object and quotation marks are functional expres- 
sions. This would explain the symmetry between expressions with 
and without quotation marks. The idea of functionless functional 
expressions goes down hard, however. The fact that these purported 
functions can be left off when context requires indicates, I think, 
they are more like context markers (punctuation) than like func- 
tions. Given that neither of these proposals is satisfying, I shall stick 
to the view that the quoted expression mentions the object, while 
the quotation marks are punctuation. 

Before I leave this topic, I want to examine an argument sug- 
gested by Davidson’s article. The argument is designed to show that 
quoted expressions cannot be parts of quoting sentences. If the 
argument is sound, then we must give up our contention that the 
quoted expression is the mentioning expression.’® The basis of the 
argument is Davidson’s conception of the finiteness constraints on 
learnable languages. “In an adequate theory,” Davidson writes, 
“every sentence is construed as owing its truth or falsity to how it is 
built from a finite stock of parts by repeated application of a finite 
number of modes of combination” (op. cit., p. 83). In other words, 
for the interpretation of a language to be finite, only a finite num- 
ber of elements may occur in its sentences and the semantics must 
be determined by a finite number of finitely statable principles. This 
condition reflects assumptions about learnability. The number of 
basic expressions is limited because a speaker could not learn the 
semantic values of an infinite number of expressions. Sentences of a 
learnable language cannot contain an infinite number of expres- 
sions whose semantic values have been stipulated. Admitting the 


'* This argument is closely connected with the argument Davidson gives against 
the name theory; op. cit., p. 83. 
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quoted expression into the quoting sentence, so the argument goes, 
would lead to a violation of this principle. 

To see how quotation is supposed to run afoul of the principle, we 
need only consider definitions. There is a well-established practice 
of using quotation to introduce expressions in definitions. (12) is 
typical: 

12. “Q means alternatively. 

The argument assumes that semantic values of basic expressions 
which can occur in sentences of a language must have been given by 
stipulation. The quoted expression in (12) has a semantic value 
(something like a shape), which determines the content of the defini- 
tion, what the definition is a definition of. The expression is also 
clearly simple. The sheer variety of expressions that can be used in 
this way makes it impossible for such expressions to have been con- 
structed from a finite number of more basic parts. Therefore, the 
expression in (12) and others like it must have had their values 
stipulated. But the number of expressions which can be used in this 
way is infinite. Admitting.such expressions into defining sentences 
will violate the Davidson’s finiteness constraint. 

I cannot say that Davidson offers a solution to this problem, since 
he does not state the argument. One aspect of his theory, however, 
looks very much like a response to difficulties raised by the argu- 
ment. Davidson excludes quoted expressions from quoting sen- 
tences. He admits into sentences only expressions which occur out- 
side of quotation, expressions which have already been defined. Be- 
cause only a finite number of definitions will have been introduced 
at any given time, the number of expreasions able to occur in sen- 
tences: will be finite, at any given time. Finiteness is preserved. 

I think that such drastic measures are unnecessary and that the 
argument that motivates them is unsound. The weak point is the 
assumption that the semantic value of a semantically primitive ex- 
pression must have been stipulated. This assumption rests on the 
view that that stipulation is the only way in which a semantically 
simple expression can acquire a semantic value, something which is 
true if all uses of expressions are standard uses, but is not true if, as 
the identity theory maintains, the semantic values of expressions in 
quotation are determined by a general principle. In section VI, I 
give arguments for why I think that the identity theory offers a 
correct account of how quoted expressions get their values. ° 


19 At this point, I am already m a position to note that the identty theory saves 
the argument from infinite regress. The argument is based on the case of the 
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Once it is granted that the values of expressions in quotation are 
determined by the principle of identity, there is no longer a need to 
limit their number. The principle of identity, being single and fi- 
nitely stateable, offers a finite interpretation for an infinite number 
of expressions. The constraints on finiteness only serve to limit the 
number of expressions in the lexicon, the set of expressions that can 
occur in sentences outside of quotation. The upshot is that finiteness 
constraints provide no grounds for excluding quoted material from 
the sentence. Therefore, they give us no reason not to take the 
quoted expression to be the mentioning expression. 

v 


What type of object is mentioned in quotation? I believe that quota- 
tions may mention any type of expression which can be used. These 
expressions may be words or phrases. They may have a definite set 
of linguistic characteristics or they may be ambiguous. Quotation 
may also be used to discuss words and phrases which lack meaning, 
but which seem to have syntactic properties, as one finds, for exam- 
ple, in Lewis Carroll’s “Jabberwocky.” It may even be used to talk 
about expressions utterly devoid of linguistic characteristics. By 
stretching the device to its limit, one can perhaps talk about tokens 
or shapes. 

This position contrasts with the views of all four writers both in its 
plurality and its specifics. These writers share a liking for uniformity. 
Davidson’s position is distinctive in its austerity: in all of its uses, 
quotation mentions shapes. One of Geach’s arguments suggests a 
position that is distinctive on the other extreme. It seems to imply 
that quotations always mention expressions with definite and articu- 
late linguistic characteristics. Two other answers see all quotations 
as concatenations. Tarski and Quine’s spelling theory implies that 
quotations mention strings of letters, while Geach’s lexical theory 
implies that quotations mention strings of words. In this section, I 
attempt to show that the claim that all quotations mention a single 
type of object is theoretically unmotivated and factually untenable. 


introduction of new expressions through definition. Defmitions, however, are 
ee but stipulations. (According to the identity theory, what they stipulate are 
standard semantic values.) If the new expressions which occur in definitions al- 
ra hepa stipulated semantic values, which they contribute to defining sentences 
t assumes), we should wonder when eine ser epee 
formed. assumption that the quoted expression in a defimition its 
value through definition leads directly to regress. On the identity theory, no 
regress occurs because no prior definition is necessary. Only the standard value is 
given by stipulation; the quotational value of new expressions is determined by 
the principle of identity. 
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Let us consider Davidson’s theory. He claims that quotations 
mention shapes, but does not tell us what shapes are. We may pre- 
sume, however, that shape would be the sort of thing that the in- 
scriptions 


3066 oO 6 OP 


have in common. It is something that they would have in common 
whether written in pencil, pen, or printed as they are. In the case of 
spoken tokens, shape is a metaphor for phonological form, a prop- 
erty shared by utterances of increasing volume but identical diction 
and accent. 

Davidson’s demonstrative analysis (op. ctt., p. 91) would give the 
sentence in (15) the gloss in (14): 


. ‘Snow is White’ is a sentence. 
a Snow is White. The expression with the shape of that token is a 
sentence. 


In the analyzed version, the quoted expression is set off from the 
quoting sentence to underscore Davidson’s contention that the 
quoted expression does not belong to the sentence as a grammatical 
constituent. Its occurrence within the sentence token is a reflection 
of nothing more than stylistic convention. In (14) the italicized 
‘that’ refers to the inscription ‘Snow is White’ as a demonstrative 
pronoun might refer on other occasions to objects salient in the 
context. 

While not clearly mistaken, Davidson’s commitment to shapes in- 
troduces what appear to be unnecessary complications. In his analy- 
sis, shapes enter into the truth conditions of sentences containing 
quotations only to give way to expressions that have that shape. It is 
not difficult to see why this is necessary. The same linguistic charac- 
teristics accrue to two tokens if the tokens belong to the same ex- 
pression type. What is said in a use of a sentence such as (15) de- 
pends not on the shape of the token but on the expression: 


15. ‘SNOW IS WHITE’ is a sentence. 

15’. ‘snOw is wHiTe’ is a sentence. 

15’. ‘Snow is White’ is a sentence. 
These sentences make the same claim because the tokens are in- 
stances of the same expression. That the quoted tokens differ in 
shape is immaterial from the semantic point of view. 


™ Davidson's account of direct discourse 1s based on his account of quotation. 
According to hum, a direct quotation of the form in (a) should be expanded as (b) 
p 92: 
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Davidson’s preference for shapes is especially puzzling given that 
the content of what is said in the examples he uses never depends on 
the shape of the quoted token. Why then does he not say that quota- 
tions ordinarily mention expressions, when this would more accu- 
rately reflect the content of his proposal, not to speak of the true 
state of affairs in quotation? 

Davidson does not explicitly defend his claim, but some consider- 
ations can be found in its favor. One is generality. Everything that 
can be embedded in quotation marks has a shape. This shape will 
always be there to be mentioned. Although it is a slight abuse of the 
device, one might argue that (16) would not be intelligible if quota- 
tion mentioned expressions, because a circle over perpendicular 
lines is not an expression: 


16. “$ is symmetrical. 





a X sad “P” 

b X sad, using words of which ¢thzs is a token, that P 
In verbatim reports of another’s speech, the quoted expression two roles. 
First, the words are ascribed to the speaker as something utt en they are 
ascribed to the speaker as the content of what was said The theory predicts that 
the direct discourse report in (c) should be rendered as (d): 

c. Hume said to his companions, “you must tell Mr. Adams to keep stro 

chairs for heavy philosophers” (after picking himself up off the bat); jokin 

Burton Hill, The Life and Correspondence of David Hume (New York: Garland, 

1983), p. 440 

d. Hume said to his companions, using words with the shape of this token, that 

you must tell Mr. Adams to keep stronger chairs for heavy philosophers. 

But this is incorrect. In general, what matters to quotation is the expression and 
POPI PARETE ee rs ee ee 
(a) will be true if only if we utter the same words that Hume uttered’ David- 
son’s truth conditions, requiring as they do sameness of shape, are too restrictive. 
Indeed, on his is, it is hard to see how a sentence hke (c) could possibly be 
true. If sameness of shape were what mattered, then repo: writing m 
or speech in wnting would be an impossibihty—on no Aa b ot 
shape do written and uttered tokens have the same shape. Correct reporting in 
the same medium would not be ruled out but would require skills of impersona- 


tion or fo 

indexical cemo dei tation also pose problems for Davidson’s ac- 
count. As analyzed by Davidson, Hume would be reported as directing his request 
to my audience rather than bis own and his request would to the t 
rather than the past. This is not what Hume succeeded in do: "Although T an 
socio AEA ES SE EAE ress the condi- 
tions of direct quotation in terms of indirect quotation as follows: 

‘X said “P” ’ as uttered by me now is true iff X said, using words of which ‘P’ is 
a token, that Q. 

where ‘Q’ and ‘P’ stand for sentences and ‘Q’ as uttered by me now expresses the 
content of ‘P’ as uttered by X. 

(The fact that indexicals pose a problem for Davidson’s account was pointed out 
to me by John Perry.) 
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That is to say, one might believe the only expressions there are are 
meaningful ones. On the shape theory, the problem of a lack of an 
available object would not arise, since the symbol above obviously 
has a shape. Similarly, if per chance one did want to distinguish 
among the quotations in (15), one might think that it would be 
possible to do so if quotations mention shapes, but not if they men- 
tion expressions. The thought is that shapes cut finer distinctions 
than expressions. If we could give correct truth conditions for all 
quoting sentences in terms of shapes, then shapes might form the 


basis of an adequate analysis. 
The problem is that it does not seem possible to explain the truth 


of all conceivable uses of quotation using shapes. For if we are 
willing to accept the likes of (16), I cannot see how we could oppose 


17. £D ’ is large, but P is not large. 
Such a use could not be explained on the assumption that quota- 
tions mention shapes, since both occurrences of the expression have 
the same shape. If the search is for the greatest generality, shapes 
will not do. The search would inevitably lead to the conclusion that 
quotations mention tokens, or better yet tokenings, objects that are 
always present in quotation and allow for the finest distinctions. 

The shape and token proposals rest on a number of questionable 
assumptions. One is that the uses of quotation in (16) and (17) are 
worthy of explanation; another is that quotations mention the same 
type of object in all cases. I shall not take a position on the first, but 
note that these uses are, at the very least, highly unusual.”! The 
second, however, is clearly dubious, and as a general policy it should 
be resisted. If, for example, we assumed that reference to material 
objects is reference to a single type of entity, we would have to say 
that whenever we refer, we refer to space-time slices, since four- 
dimensional volume is the one property that all material objects 
have in common. The absurdity of this view reflects badly on its 
analogue in the case of quotation. Furthermore, I see no reason why 
we should be committed, as this line of reasoning seems to be, to a 
rigid concept of expression which requires expressions to be mean- 
ingful and which precludes any distinction between the quotations 


21 Hans Reichenbach, in Introduction to Symbolic Logic (New York: MacMillan, 
1947), mtroduces special tation marks, which he calls “token quote-marks,”’ 
which allow a speaker to reler tony object standing becween them, oken quotes 
turn the wntten page mto a relief map that may contam, in addition to words and 
punctuaton, sand, home appliances, or real estate, indeed anything a person 
might want to talk about. For i Had Bi I need only note that Reichenbach 
introduces special symbols to handle token mentioning quotations, rather than 
treat it as a special use of the familiar device 
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in (16). The notion of expression is flexible, and can be applied 
widely, making coarser or finer distinctions. 

A second plausible motivation for the shape theory is related to 
Davidson’s decision, discussed above, to exclude quoted material 
from the quoting sentence. He makes the connection in the follow- 
ing remark: 


... what I propose is that those words within quotation marks are not, 
from a semantical point of view, part of the sentence at all. It is in fact 
confusing to speak of them as words. What appears in quotation marks 
is an inscription, not a shape, and what we need it for is to refer to its 
shape (op. at., p. 90) 


Davidson does not explain these remarks, but it seems reasonable to 
think that his belief that quoted expressions are not part of quoting 
sentences led him to conclude that quoted expressions are not 
words; where by ‘words’, I suppose, he means interpreted linguistic 
expressions. The general idea would be that tokens that are not used 
do not possess linguistic characteristics. Quoted tokens would there- 
fore be mere inscriptions. Since inscriptions have no linguistically 
relevant properties aside from their shape, it follows that, if quota- 
tions mention what stands between the marks, quotations must refer 
to either the inscription itself or its shape. We should note that while 
Davidson’s stated position is that quotations mention shapes, his 
treatment is compatible with the assumption that they mention 
tokens. 

This argument assumes that quoted expressions are not part of 
quoting sentences. We have observed that, if the identity theory is 
adopted, there is no reason to exclude the quoted material from the 
quoting sentence. With the material readmitted to the sentence as 
an expression, it can serve as the semantic value of the quotation. 
Provided that the value-fixing mechanism of the identity theory can 
be defended, it follows that the second reason for saying that quota- 
tions mention shapes is no good. 

Let us now turn to the description theories, which are another 
source of uniformity. On the spelling theory, each multiletter quo- 
tation is short for a description that describes an expression in terms 
of its constituent letters. The nature of the descriptions implies that 
all quotations mention series of letters. The main problem with this 
consequence is that it cannot be reconciled with conceptions of 
syntax on which the syntactic resources of a language are more 
advanced than concatenation. Most current syntactic theories recog- 
nize constituents and subconstituents, whose relationships cannot 
be represented in the linear ordering of letters. If linear ordering 
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were all there is to the values of quotations, then most of what 
linguistics say about expressions could be prejudged to be false. 
The lexical description theory has consequences that are just as in- 
tolerable. 

It is interesting to note that the considerations Geach cites in 
favor of his theory actually support the idea that quotations mention 
expressions whose structures are not linear. The relevant comments 
come in the following passage: 


a quoted series of expressions is always a series of quoted expressions; 
the quotes around a complex expression are to be read as applying to 
each syntactically distinct part of the expression; and if we fail so to 
read them, because we cannot discern the syntactical parts of the 
quoted expression, then we likewise fail to understand the quotation— 
we do not know what it is a quotation of (op. cit., p. 82).™ 


In line with his official position, Geach emphasizes the serial order 
of constituent expressions. Yet if he is concerned, as he says he is, 
with the syntactic parts of sentences, he must be prepared to recog- 
nize syntactic relations that are more articulate than concatenation 
and constituents that are larger than words. If quotes apply to each 
syntactically distinct part of an expression, they must apply to noun 
phrases as well as the nouns they contain, verb phrases as well as the 
verbs they contain, etc. But this means that quotations must abbre- 
viate descriptions of entire syntactic structures. Given that quota- 
tions must correspond word for word with the descriptions they 
_ abbreviate, it follows that they must describe single syntactic struc- 
tures. For in order for a quotation to be able to mention more than 
one structure, it must have constituents that can refer to the same 
word on more than one occasion. But a single syntactically deter- 
minate expression is an unambiguous expression. Geach’s argument 
suggests, therefore, that quotations mention syntactically unam- 
biguous expressions. 

This high degree of specificity has unfortunate consequences. The 
fact that I say something true when I quote the expression in (18) 
and say of it that it is ambiguous shows that quotations need not 
mention syntactically unambiguous expressions. 

18. Flying planes can be dangerous. 

Geach suggests at other points in his discussion that quotations 
mention words bound to particular languages (op. cit., p. 86). If this 


™ This passage is slightly ambiguous, since ıt is not clear whether quotations are 
“of” what is used or what is mentioned. But this ambiguity will not prevent us 
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were 80, then it would not be possible to mention an expression 
common to distinct languages. The fact that I say something true 
when I quote the expression in (19) and say of it that it is a noun in 
English and a preposition in French shows, however, that quotation 
can mention language-neutral expressions. 


19. Car? 


Since quotation can always be used to mention the whole range of 
unambiguous expressions, these facts suggest the quotation can be 
used to mention any type of linguistic expression.** Of course, this is 


* J assume that semantics is determined at surface structure. If the mterpreta- 
tion is determined at deep structure and (21) is derived by a “conjunction reduc- 
tion” transformation from 21’, then the semantics could assign distinct references 
to the different occurrences of “car”: 


21’. ‘Car’ is a preposition in French and ‘car’ is a noun m English 
There are no good arguments for this hypothesis, however. 


™ Up to this time, I have spoken freely of types, but it should be clear that they 
are nothing more than a device to aid in facilitating generalizations, a benefit 
which can be discarded at any time. Indeed, it is a reasonable question to ask what 
an interpreted or uninterpreted type 1s. Sacra ress E hep aS tans Re 
Ce ee eee ene eee those types (For a discussion of 
the significance of the role of types in linguistic theory, see Sylvain Bromberger’s 
and Tokens in istics,” CSLI-report 88-0125.) 
us consider what I have called “uninterpreted ressions ” I have dis- 
assed dhe use of aici exprrestions'yd-Gehnitiota ike (1? I might have carried 
out this definition to the same effect using a different token, such as the one in 
(12): 
12’. ‘$ means alternatroely. 
ea eae ee 


tics. Follo Bennett, I call characteristics of a token which matter to interpre- 
tation the t ’s “linguistically relevant characteristics.” For the of 
language use, there is no ch between tokens with the same isti 


relevant characteristics. When we sit arte Vee emer aaracterbtend 
acteristics will be some subset of token’s physical characteristics. For written 
tokens, these will mclude form and perhaps size, but not weight or material com- 
position. For spoken tokens, accent will be important, but volume rarely, if at all. 
When a token is used in a definition, it is an example that serves as a paradigm for 
future uses. (It is important to underscore that this is true only of expressions that 
contain no linguis interpreted parts. For example, if the new expression 
contains letters, then letters will not be characterized by the form of the 
letter tokens that are produced in the definition.) 

When it comes to mterpreted expressions, the situation 1s more involved Since 
tokens that look or sound alike may have different linguistic properties, the lin- 
guistically relevant properties of interpreted tokens are not limited to physical 
properties. In addinon, the intentions of the speaker are also relevant. Through 
these mtentions, the token 1s associated with a certain standard usage. This stan- 
dard usage determines its standard semantic value The speaker intends to use the 
token as others have used similar tokens m the past. A speaker will not be inter- 
ested in associating his use with just any previous uses. The only uses which matter 
are those connected to the current use by a chain of communication Uses com- 
pletely cut off from the current one will not be of interest (The picture 1s implicat 
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not surprising. The variety of mentioned expressions simply reflects 
the variety of used expressions; a quoted expression mentions itself. 
VI 


How is the semantic value of the mentioning expression deter- 
mined? We have decided that in quotation the expression used and 
the expression mentioned are one and the same. The question that 
remains is how the expression is associated with itself. The name, 
description, and identity theories all offer different answers to this 
question. 

The name theory disregards all appearances of structure. It sees 
all quotations, whether of a word, a sentence, or an entire discourse, 
as single semantically simple expressions. The description theories, 
which recognize structure where the name theory does not, select 
quotations of words or letters as semantically simple and builds 
complex quotations from them. What the theories have in common 
is their reliance on standard semantics and semantically simple ex- 
pressions, though their selections of simple expressions differ. The 
theories recognize only the standard use of simple expressions. 
Now, the defining characteristic of a semantically simple expression 
is that its standard semantic value is determined by stipulation. I 
shall argue that this feature of simple expressions is incompatible 
with the facts of quotation. Neither whole quotations nor their parts 
can receive their semantic values by stipulation. By approaching the 
issue from different though related perspectives, I hope to highlight 
the consequences of the differences between the name and descrip- 
tion theories, on the one hand, and the identity theory, on the other. 





in Saul Kri and Gareth Evans, Namng and Necessity (Cambridge’ Harvard, 
1980) and ‘ Causal Theory of Names,” in The Philasophy of Language. Phys- 
cal form enters the picture msofar as the attributes of a token cannot deviate 
arbitrarily far from some paradigm and sull be identified with other tokens. 

Thus picture applies not only to names and natural-kind terms, but to all basic 
expressions. Even the use of a demonstrative involves an association with a net- 
Woni GF past usage: This alco pil thane Detorn ‘that’ a use of a demon- 
strative. The difference between indexicals, like pronouns and demonstratives, 
and nonindexicals, like names, predicates, and conjunctions, is that once the lat- 
ter are connected with a standard use their semantic value is determined, whereas 
the standard use only determines the rules of reference of the former These rules 
set conditions under which the particular token picks out a particular object. 
(Similar views can be found in D. Kaplan, “Words,” Proceedings of the Aristotelian 
Society, LXIV (1990): 92-119.) 

Statements about types can be understood as follows: to say that a type is used 
is to say that a token with certain characteristics 1s used. To say something about a 
type is to say something about tokens that have or might have had certain charac- 
teristics. We can identity an uninterpreted expression with a set of physical char- 
acteristics and a basic interpreted expression with a network of interpreted 
tokens. 
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To understand what the various views involve, it is helpful to con- 
sider the grammar of a small language containing quotation. The 
grammar that would result if the name theory were adopted is given 
below. 


Lexicon Phrase Structure 
NPs PRED ADJ S—NP is PRED 


Otto mentions Otto big mentions bigness very PRED > ADJ PRED 
Otto mentions Otto 

big mentions big 

big big mentions hg big 

and soon... 

very mentions very 

very very mentions very very 

and soon. . 

very big mentions very big 

Otto ts big mentions Otto ts big 

Otto is very big mentions Otto is very big 

Otto 1s very very big mentions Otto ts very very big 


and soon... 
CHART C 


Expressions in italics stand for expressions. Expressions in bold face 
stand for nonlinguistic entities. Quotation takes place when an ex- 
pression mentions an expression. On the name theory, every un- 
quoted expression, whether complex or simple, whether grammati- 
cal or ungrammatical, has as a counterpart a quoted expreasion that 
is a distinct semantically simple element. Every one of the infinity of 
quoted expressions is part of the lexicon. The grammar of the lexi- 
cal description theory is the same except that only the quoted coun- 
terparts of unquoted words are part of the lexicon. Complex quoted 
expressions are built up from simple quoted parts. Complex quoted 
expressions will be well-formed NPs even if their unquoted counter- 
parts are ungrammatical strings of words. On the spelling theory, 
the only quoted expressions in the lexicon are quotations of letters. 
The identity theory, in contrast, does not put any quoted expres- 
sions into the lexicon. The lexicon of the language above would 
contain ‘Otto’, ‘big’, and ‘very’, and a rule of interpretation would 
be added: quote (a) mentions a, for all a. 

When the theories are presented in this way, it is easy to see the 
shortcomings of the name and description theories. Quoted expres- 
sions in chart C receive their values on an individual basis with no 
eye to regularity. This way of proceeding does not guarantee that 
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quoted expressions match up with their values in any particular way. 
Whatever correlations arise are the result of accident, not planning. 
So, when we look at the grammar, we have to think that it is just a 
coincidence that ‘very’ mentions very, ‘Otto is big’ mentions ‘Otto is 
big’, and in general, expressions mention themselves. It would not 
go against the semantic rules if (20) were to turn out to be false 
because ‘Otto’ mentioned ‘supercalifragilisticexpialidocious'’: 


20. ‘Otto’ is big. 


This shows that the grammars of the name and description theories 
do not capture the phenomenon of quotation. It is of the essence of 
quotation that quoted expressions mention themselves and this es- 
sence is not an essential part of these theories.” 

The productivity of quotation also poses insurmountable prob- 
lems for the name and description theories. The name theory treats 
each quoted expression as a special case requiring individual stipula- 
tion. Since only a fmite number of stipulated expressions can occur 
in sentences of the language, the name theory predicts that only a 
finite number of expressions can occur in quotation. Previous ob- 
servations have already shown that this is not the case. I noted that 
an infinite number of new expressions may be defined using quota- 
tion and these expressions will be part of quoting sentences.”* In- 
deed, there is no need to go beyond expressions that already have 
meaning. There are an infinite number of meaningful expressions in 
English and everyone of them can be quoted. If the name theory 
were correct, this would not be possible. 

The productive resources of the description theories put them in 
a somewhat better position to confront the productivity of quota- 
tion. By choosing one simple quoted expression for each simple 
unquoted expression, the theories generate an infinite number of 
quoted expressions that mirror the infinite number of unquoted 
expressions. The problem with these theories is that not all expres- 
sions can be built out of the simples currently in the language. For 
instance, all potential new expressions cannot be built out of a finite 
number of parts. Since descriptions must be constructed from a 
finite basis and any expression may occur in quotation, there will 
not be enough descriptions to describe all expressions that may oc- 


1 This objection to the name and ion theories is m the spirit of those 
offered by Davidson; see op. at, pp. 83, 8 

* This is essentially the line taken by the argument in section II That argu- 
ment treated the occurrence of new expressions as the standard occurrence of 
simple expressions 
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cur in quotation. For this reason, the descriptions theories cannot 
be correct as general accounts of quotation.”’ 

The third and last aspect of quotation that I shall consider is its 
opacity. The opacity of the quotation is illustrated by the fact that 
substitution of ‘Clemens’ for ‘Twain’, which in other cases would be 
licensed by (23), leads from the true (22) to the false (24): 


22. “Twain was a writer’ has 15 letters. 
24. Twain = Clemens 

24. ‘Clemens was a writer’ has 15 letters. 
25. Twain was a writer 

26. Clemens was a writer. 

27. Clemens 

28 Twain 


The truth of (22) and the falsehood of (24) shows that the names 
‘Twain’ and ‘Clemens’ cannot be used interchangably in quotation. 
The name and description theories explain this fact in two stages. 
First, they determine the semantic structure of the quoted expres- 
sion, and then assign semantic values to the simple parts. The name 
theory treats the entire quoted expressions in (22) and (24) as simple 
wholes. As simples, it assigns to them the expressions in (25) and 
(26) as semantic values. The difference in the lengths of the seman- 
tic values of the quoted expressions explains the difference in the 
truth values of (22) and (24). The fact that the unquoted sentences 
have the same semantic value is irrelevant. The lexical description 
theory assigns the quoted expressions a structure in which the words 
are semantically simple. It assigns the expressions in (27) and (28) to 
the quoted names ‘Twain’ and ‘Clemens’. The difference in the 
lengths of the semantic values of the quoted names makes a differ- 
ence in the length of the semantic value of the quoted sentences, 
which explains the difference in truth value between (22) and (24). 
The fact that the unquoted names have the same semantic value is 
irrelevant. Similar things are true of the spelling theory. 

One should not be impressed by the apparent success of these 
theories in explaining opacity. Their emptiness is obvious. The 
weakness is again due to the stipulative nature of the semantics. In 
each case, the difference in truth value comes down to the differ- 
ence in the semantic value of the basic expressions, values which are 
perfectly arbitrary. Without some principle backing the assignment 
of values, the explanation of the difference in the truth values is 
simply that quoted sentences mention different things. But this 
much we knew before we got started. 


7 These objections are again essentially Davidson's; see op. cit., pp. 83, 89 
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The identity theory furnishes simple and natural explanations for 
the facts that cause trouble for the name and description theories. 
First, consider the fact that a quoted expression invariably mentions 
itself. This correlation appeared on the name and description the- 
ories to be merely a fortuitous coincidence. According to the iden- 
tity theory, it is the rule of quotation for which there are no excep- 
tions. If an expression does not mention itself, then it is not quoted. 
Second, the fact that we can quote an infinite number of meaning- 
less expressions, which turns out to be incomprehensible on the 
name and description theories, is not an issue for the identity 
theory. Using a single principle, the identity theory churns out in- 
terpretations of all quotations. The principle applies with equal in- 
difference to interpreted and uninterpreted expressions, and ap- 
plies to a finite number just as quickly as it does to an infinite 
number. As a result, problems having to do with the size of the 
interpretative apparatus do not afflict the identity theory. Finally, 
there is the phenomenon of opacity. Using the distinction between 
quotational and standard uses of expressions, we can say that the 
use of an expression is opaque if its standard semantic value may 
change when one of its constituents is replaced by another expres- 
sion with the same standard semantic value. By tying the semantic 
value of a quoted expression to the identity of the expression, the 
identity theory makes it clear why uses of expressions in quotation 
are opaque. In the case discussed above, replacing ‘Twain’ by ‘Clem- 
ens’ in (22) lengthens the expressions used and so lengthens what is 
mentioned, making (24) false. The fact that ‘Twain’ and ‘Clemens’ 
have the same standard semantic value is irrelevant. 

Although Davidson’s demonstrative theory is false, I believe that 
quotation has more in common with demonstration than with nam- 
ing or describing. We can demonstrate an object through one of its 
tokens. I can mention a type of car by pointing to a particular car 
and saying: ‘that model” or “the model of that car.” In demonstra- 
tion, the only requirement is that the object or its proxy be salient in 
the context. In quotation, we also mention an object through one of 
its tokens. The difference between quotation and demonstration is 
that quotation makes a more economical use of resources. Since 
what is mentioned is a piece of language, we economize and use it in 
a syntactic and semantic capacity in the sentence. 

COREY WASHINGTON 


University of Washington 
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NOTES AND NEWS 


The Association for Practical and Professional Ethics announces its second 
annual meeting, to be held March 4-6, 1993, at the University of Maryland/ 
College Park. The program will include: a keynote address by Sissela Bok; 
papers and presentations on ethical concerns in various fields; papers and 
presentations on ethical issues that cut across disciplines; demonstrations 
on the pedagogy of teaching ethics. For more information, contact Brian 
Schrag, Executive Secretary, Association for Practical and Professional Eth- 
ics, 410 North Park Avenue, Bloomington, IN 47405. 


Jones and Bartlett Publishers is pleased to announce that it has activated its 
textbook publishing program in philosophy, under the direction of Arthur 
C. Bartlett, and that it has acquired the distinguished philosophy list of 
Freeman, Cooper, and Company. Under the Consulting Editorship of Rob- 
ert Ginsberg of Penn State University, Jones and Bartlett is actively seeking 
philosophy texts in traditional fields, as well as in Feminist Theory, Applied 
Ethics, African Thought, Philosophy of Aging, Health Care, etc. Requests 
for examination copies or proposals for new volumes may be addressed to 
Arthur C. Bartlett, Jones and Bartlett Publishers, Inc., 30 Granada Court, 
Portola Valley, CA 94028-7730. 


Rodopi Publications is pleased to announce a program of scholarly works in 
philosophy, the Value Inquiry Book Senes, under the Executive Editorship 
of Robert Ginsberg, Penn State University. Monographs, thematic volumes, 
special series, conference proceedings, and translations into English will be 
considered in all areas of value inquiry. For further information, contact 
Robert Ginsberg, Executive Editor, Value Inquiry Book Series, Penn State 
Univ., Delaware County Campus, Media, PA 19063-5596 


Routledge is pleased to announce the publication of The International Jour- 
nal of Philosophical Studies (formerly Philosophical Studies, Ireland) begin- 
ning with Volume I, Number 1, March 1993. The Editor is Dermot Moran, 
and the Reviews Editor is Maria Baghramian. The aim of the journal is to 
publish articles of the highest academic quality in all areas of philosophy. In 
particular, it seeks to promote dialogue between “analytic” and “‘continen- 
tal” styles of philosophy, as well as to understand how issues in the history 
of philosophy shed light on current debates. There will be a comprehensive 
Book Review section, as well as a Critical Notice section where a major book 
(or theme bringing together several books) will be examined in greater 
depth. For information on submissions, write to Dermot Moran, Editor, 
The International Journal of Philosophy, Phil. Dept., University College 
Dublin, Dublin 4, Ireland, on book reviews and critical notices, to Maria 
Baghramian (also University College Dublin); on subscriptions, to Trevina 
White, Routledge Subscriptions, ITPS Ltd., Dept. J, Cheriton House, 
North Way, Andover, Hants SP10 5BE, England. 
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KNOWLEDGE OF THE PAST AND FUTURE* 


ost of the knowledge to which scientists aspire is of events 
not taking place at present. We construe the world as 


being composed of events that stretch from the distant 
past, as much as twenty billion years ago, to a time that is perhaps 
much further in the future. Our knowledge of individual events that 
are observationally remote from us is of a different character than is 
our knowledge of readily observed events. We cannot hope to learn 
about most remote events by simple observation. Instead, we de- 
pend on the processes of prediction—for the future, and retrodic- 
tion—for the past, in addition to observation, our most reliable 
source of information. Since prediction and retrodiction are funda- 
mentally theoretically based processes, the ultimate scope of predic- 
tive and retrodictive knowledge depends on basic physical theory 
much more than does observational knowledge. As a result, our 
possibilities for knowledge of the past and future have been in- 
fluenced by modern developments in physical science. We discuss 
those developments and their implications for prediction and retro- 
diction here." 
Prediction and retrodiction are often taken to be symmetric pro- 
cesses, and indeed we believe that they are in certain circumstances, 
such as those arising in celestial mechanics. Since in other circum- 


* The Editors regret that Professor Femberg died shortly after this jointly wnt- 
ten article was accepted. Please see De eg 1992 issue for an obituary 

1Tt ought to be remarked that we be discussing this largely from the 
perspective of what have become the standard interpretations of those develop- 
ments; and (more particularly) that we shall be discussing the implications of 
quantum mechanics for prediction and retrodiction largely from the perspective 
of the Copenhagen interpretation. 

One of us (D.A ) has wnitten elsewhere on these questions from very different 
perspectives. 


0022-862X/92/8912/607-42 © 1992 The Journal of Philosophy, Inc. 
607 ~ 


608 THE JOURNAL OF PHILOSOPHY 


stances retrodiction involves additional complications, however, we 
begin with the more familiar case of prediction. 
PREDICTION 

The ability to predict future events has been one of the hallmarks of 
science ever since Thales predicted a solar eclipse in the sixth cen- 
tury B.C. Progress in astronomy and mechanics, through the work 
of Kepler, Newton, Laplace, and Einstein has been tied to ever more 
accurate predictions of the positions of the sun, moon, and planets. 
By the nineteenth century, it had become possible, through the use 
of sophisticated mathematical analysis applied to Newton’s laws of 
motion, to predict the positions of the bodies in the solar system for 
hundreds of years into the future. In the twentieth century, scien- 
tists have made extremely accurate predictions of future motion, 
both of objects within the solar system and of certain extrasolar 
double-star systems, and have used these predictions to test some of 
the consequences of general relativity theory. 

The predictions made in Newtonian mechanics are of the motion 
of widely separated compact objects, or “particles.” While analo- 
gous predictions can be made for systems that do not consist of 
particles, such as electromagnetic fields as described by Maxwell’s 
equations, the main features of the standard conceptions of predic- 
tion are already exemplified in the Newtonian case, so we shall con- 
centrate on that at first. 

I. NEWTONIAN MECHANICS 
Suppose first that we wish to be able to predict future positions of a 
collection of particles using information that we can obtain now. 
The problem has three aspects: what is in principle predictable from 
the present physical state according to Newtonian mechanics, which 
of the properties of the present physical state which are necessary 
for such predictions are in principle discoverable according to New- 
tonian mechanics, and what additional limitations are imposed by 
the fact that we are finite creatures of bounded capacity. The third 
aspect is less fundamental than the other two, but at least the follow- 
ing three limitations that follow from it are general enough to be 
worth considering: we cannot measure continuously variable physi- 
cal quantities with perfect accuracy; we cannot make use of infinitely 
much information or carry out infinitely long computations; and we 
must gather all of our information to one location to make use of it. 

Newton’s laws mathematically determine the future positions of a 
collection of interacting particles in terms of present properties of 
the system, provided that the forces acting on the particles are 
known at all intervening times. What the forces are depends on 
aspects of physics which go beyond Newtonian mechanics. There are 
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a variety of circumstances, however, in which we are confident that 
we know the forces that are acting. In the case of planetary motion, 
for example, these forces are to a high degree of accuracy nothing 
but the mutual gravitational attractions between the sun, the plan- 
ets, and the other bodies in the solar system. Other forces, such as 
the pressure exerted by the “solar wind,” can easily be taken into 
account as small corrections. The problem of knowing what forces 
are acting in the general case is not so much a problem of prediction 
as a problem of the scope of physical knowledge and should be 
treated in this wider context. We discuss this problem briefly below. 

The properties of a system at the present time which are required 
for prediction are usually referred to as the initial conditions. For 
the solar system, insofar as the planets can be treated as mass points, 
the initial conditions amount to the position and velocity of each 
planet at one specific time, or something mathematically equivalent 
to this. Because the planets move in three-dimensional space, this 
means that at one time six numbers must be known for each planet. 
Armed with this knowledge, it then becomes possible to determine 
the future positions of each of the planets by applying Newton’s 
laws of motion. 

The above type of analysis led to one of the great triumphs of 
Newtonian physics. In the nineteenth century, the procedure was 
applied by Leverrier and Adams to the then recently discovered 
planet Uranus. They found that the procedure did not correctly 
describe the motion of this planet and drew the conclusion that an 
unknown force must be acting on Uranus to cause the deviation. 
They attributed this force to a yet unknown planet, and by a further 
use of Newton’s laws were able to infer the position of the new 
planet, which was soon observed and named Neptune. Thus the 
failure of one type of prediction led to a successful case of predic- 
tion on another level. 

Later in the nineteenth century, a similar effort to give a precise 
analysis of the motion of the planet Mercury using the influence of 
the known bodies in the solar system also showed a deviation from 
the Newtonian prediction. Attempts to repeat the successful discov- 
ery of Neptune by inferring a new planet, Vulcan, failed in this case, 
and the explanation of the discrepancy came in the twentieth cen- 
tury through the discovery of what is in effect an additional non- 
Newtonian force on the planets, whose existence is implied by the 
general theory of relativity. 

Il THE APOTHEOSIS OF DETERMINISM 
Even before the application of the Newtonian method to predict the 
existence of Neptune, the mathematician Laplace, one of the | 
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leaders in applying Newton’s laws to planetary motion, used New- 
ton’s ideas to draw a sweeping conclusion about scientific knowl- 
edge. Laplace imagined the universe as being composed of an im- 
mense number of particles, with known forces acting between them. 
He then described the view of such a universe which might be taken 
by a demonic observer who was ingenious enough to be able to 
determine the exact positions and velocities of all the particles at 
one time. For such an observer, the future motion of all the parti- 
cles would be exactly predictable. With the further assumption that 
more complex natural phenomena could be reduced to the motion 
of particles, it would then follow that Laplace’s demon could predict 
the complete future behavior of the entire universe.” 

`” Laplace’s vision is the apotheosis of predictive determinism. A 
physical theory is deterministic in the sense of Laplace if the future 
values of those quantities whose behavior the theory describes can 
be precisely predicted on the basis of some present values. In New- 
tonian mechanics, the quantities whose present values are needed 
are the positions and velocities of particles. Six numerical values 
must be known for each particle, so that, if the number of particles 
is finite, then precise knowledge of the present values of a finite 
number of quantities allows the prediction of the whole future. 
Usually, the initial conditions are taken to be the values of coordi- 
nates and velocities of all particles at one time. These are not the 
only values, however, that can be used to fix the solution to the 
Newtonian equations of motion. It is possible, for example, to spec- 
ify the positions of all objects at two different times instead, or some 
combination of positions and velocities at different times. Although 
there are special circumstances in which these alternatives will not 
work, such as when the motion is highly periodic, there are also 
cases in which one or another is much more convenient than the 
standard initial conditions, because it is difficult to carry out suff- 
cient measurements at a single time. 

Other deterministic theories require the measurement of an infi- 
nite number of quantities at one time in order to predict the future. 
In Maxwell’s theory of the electromagnetic field, for example, it is 
necessary to know the field at every point in space at one time, as 
well as the position and velocity at this time of all the charged parti- 


* The recent very careful treatment of issues connected with the topics of deter- 
minism in Tal and Laplacian determmism in particular by John Earman (A 
Primer on Porania (Dordrecht: Reidel, 1986)) should be consulted for fur- 
ther information on these topics. The problems raised there for Laplacian deter- 
minism overlap to some extent with those to be raised here, but Earman raises 
other serious objections there which are not directly relevant to the present dis- 
cussion 
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cles that affect the field, in order to predict their future values. 
Nevertheless, if all of these are known, then the future behavior of 
the fields and particles can be exactly predicted. 

Another type of prediction is possible in Newtonian mechanics 
which differs in important ways from Laplacian prediction.° Sup- 
pose that we are told that a measurement will be made of the nu- 
merical value of a specific physical quantity, such as one component 
of the position of a particle, at a definite time, t, in the future. 
Assuming that the forces acting on this particle are known, it is 
possible to predict the result of this measurement by carrying out a 
single measurement at present instead of six measurements for each 
particle as would be required for Laplacian prediction. What must 
be measured in order to predict the future result is the present 
value of a specific function of position and velocity, whose form 
depends on the forces acting and upon the time at which the future 
measurement will be made. If no forces act on the particle up until 
the time of the measurement, then the function that must be meas- 
ured at present is just a linear combination of position and velocity. 
It is given in this case by X = x + pt/m. Otherwise, the function is 
more complicated, although it can in principle be calculated by solv- 
ing the equations of motion. 

On the one hand, this type of prediction, which we have called 
single-measurement prediction, is more economical than Laplacian 
prediction since it requires less present input. On the other hand, it 
is less powerful in some respects. The present measurement will in 
general be useless for predicting the results of position measure- 
ments made at times other than t, although it does allow the predic- 
tion of a specific combination of position and velocity which will 
vary from one time to another. For Laplacian prediction, one set of 
present quantities suffices to determine the results of any future 
measurement whatsoever. This difference can be described by the 
statement that Laplacian prediction allows us to predict everything 
about the future, while single-measurement prediction allows us to 
predict any specific thing that we choose. We shall see that another 
important difference between Laplacian and single-measurement 
prediction is that single-measurement prediction can be taken over 
with no change to quantum mechanics, while Laplacian prediction 
cannot. 

II PROBLEMS FOR LAPLACIAN DETERMINISM 
The logic of Laplacian determinism implies that, in order to predict 
the future, knowledge of certain quantities is necessary and suffi- 


3 Feinberg, Albert, and Lavine, Physics Letters A, CXXXVIII (1989): 454-8. 
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cient. Precise prediction requires that the initial conditions be 
known exactly for all of the objects that influence one another. 
Furthermore, all of the forces that can act over the time period for 
which the predictions are to be made must also be exactly known. In 
the time since Laplace expressed his vision of a deterministic uni- 
verse, it has gradually been recognized that serious problems exist 
with each of these requirements for prediction. 


III.1. Sensitivity of predictions to initial conditions. There will always 
be errors in the measurements of physical quantities which make up 
initial conditions. These errors derive from several sources. One is 
the use of instruments that can in principle measure only with lim- 
ited accuracy. Such errors can in principle be made smaller by the 
use of higher precision instruments. Although it seems clear that no 
continuously variable quantity can be measured with perfect preci- 
sion, it is by no means clear a priori just how precisely such a quan- 
tity actually can be measured, especially in Newtonian mechanics, 
which does not include a theory of the structure of matter. 

Another source of measurement error is the limited availability of 
some physical quantity, such as energy, that is needed to carry out 
certain measurements. According to quantum mechanics, for exam- 
ple, the measurement of physical structure on very small scales re- 
quires the use of very large energies. This limits the accuracy of such 
measurements because of the limited amount of energy in the actual 
universe. 

It should be also be pointed out that the specification of even a 
single physical quantity to arbitrary precision requires giving a real 
number, which contains an infinite amount of information, i.e., all 
of its digits. A more prudent approach to what must be specified in 
order to make predictions would be to find out how much can be 
predicted on the basis of a finite amount of information, such as the 
first N digits of the initial position and velocity of an object. 

These considerations imply that the actual value of a physical 
quantity may fall anywhere within a range of values around the at- 
tributed value. If a small imprecision in our knowledge of the value 
of the initial conditions leads to only a small imprecision in the 
prediction of some future value of, for example, a position, then the 
prediction is said to be insensitive to initial conditions. If a small 
present imprecision can lead to a large future one, however, the 
prediction is said to be sensitive to initial conditions. Sensitivity to 
initial conditions therefore means that a small uncertainty in an 
initial value may, after a time, lead to a large uncertainty in the 
predicted value. The influence on prediction of imprecision in our 
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knowledge of the initial conditions generally increases with time. 
That is, for predictions over short time periods, a small range of 
initial values will always lead to a small range of predicted values; but 
as the time span of prediction grows, the uncertainty of prediction 
for a fixed range of initial values also tends to grow, and eventually 
becomes large enough to render the prediction useless. 

An important qualification to Laplacian determinism, first real- 
ized in the last century, is that in some cases this uncertainty of 
prediction increases very rapidly with time, so that even an ex- 
tremely small initial uncertainty rapidly results in a complete blur- 
ring of predictions. This extreme sensitivity to initial conditions 
does not happen for all Newtonian systems, but it is sufficiently 
common to raise serious questions about the usefulness of the La- 
placian program even within the confines of Newtonian mechanics. 
If we can predict accurately for only a short time into the future 
even if we make measurements that are as precise as we can reason- 

- ably expect, then for the system involved the times beyond this are 
as inaccessible to present predictions as if the theory were not deter- 
ministic at all. The existence of this problem suggests that the differ- 
ence for prediction between deterministic and nondeterministic the- 
ories may be much less sharp than one might imagine. That this 
problem exists for many Newtonian systems has been realized for a 
long time, even though it is periodically rediscovered and trium- 
phantly proclaimed anew. Poincaré, Born, and Schrödinger are but 
three of the scientists who have pointed out this aspect of Newton- 
ian mechanisms in the early twentieth century, long before its 
current reemergence as a part of the study of “chaos.” 

The fact that many Newtonian systems display extreme sensitivity 
to initial conditions appears to limit effectively what we can know at 
present about the future behavior of such systems. But before ac- 
cepting this conclusion, it is worth examining whether there are any 
alternatives. 

This problem with Laplacian prediction might be avoided if it 
were possible to eliminate the need for specifying initial conditions. 
The “‘initial” conditions themselves are just the values of some physi- 
cal quantities at a definite time. Evidently, according to Newtonian 
mechanics, it is possible to determine these quantities in terms of 
quantities at an earlier time. If everything in the universe always 
satisfied deterministic laws, then the behavior of everything at all 
times could, according to those laws, be referred back to conditions 


4 For a discussion of some related issues, see the article by J Ford m Physics 
Today, XXXVI, 4 (1983), p. 40. 
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that held at the origin of the universe. As Omar Khayyam wrote, 
“The first morning of creation wrote what the last dawn of reckon- 
ing shall read.” This does not yet eliminate the need for initial con- 
ditions. We can imagine, however, that conditions at the origin of 
the universe took a particularly simple form that was much more 
constrained than the “‘initial conditions” that are usually imposed at 
later times. If that were the case, then instead of needing to know to 
high precision the values of many independent quantities at the 

inning of the universe, we would only need to know a few. We 
could then calculate what happens afterward, à la Laplace’s demon. 
There have been some attempts in recent years to find simple initial 
conditions within the framework of a quantum mechanical descrip- 
tion of the beginning of the universe. 

Alternatively, it could be that the conditions that occurred at the 
beginning of the universe were such that its subsequent evolution 
was not especially sensitive to their precise values. Those systems 
which display extreme sensitivity to initial conditions are often such 
that this sensitivity only occurs in certain regions of the space of 
values of all initial conditions, and that in other regions the sensitiv- 
ity is much reduced. 

The idea that sensitivity to initial conditions has bad consequences 
for prediction is based on the notion that initial conditions are arbi- 
trarily variable, so that observation is the only way of learning about 
them. But it may be, for example, that for some physical systems, 
initial conditions are not arbitrarily variable. Simple qualitative fea- 
tures of the system, such as its past long-term stability, may require 
that the initial conditions that applied in the distant past belonged to 
some special set of values. Lf this were the case, then the prediction 
of the evolution of such systems might not be subject to the same 
restrictions as predictions for systems for which the initial condi- 
tions can vary arbitrarily. 

Another way of approaching predictions that are very sensitive to 
initial conditions is to ask what measurements are necessary in order 
to carry them out. To be specific, suppose we wish to predict the 
value of some physical quantity at a specific future time. Then we 
know from the discussion above that a single measurement at the 
present time is sufficient. A simple generalization of this technique 
of single-measurement prediction is that, if we wish to predict the 
values that N distinct quantities will have at various future times, 
which may or may not be the same, then it is always sufficient to 
measure N quantities at the present time. This type of prediction 
does not show extreme sensitivity to the measured values of these N 
quantities. 
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A much more economical approach, Laplacian prediction, is pos- 
sible for systems that permit predictions that are insensitive to initial 
conditions, an approach in which a fixed number of measurements 
allows for the accurate prediction of all future quantities. It is plau- 
sible to ask whether there is any intermediate situation between the 
“one-for-one” requirement of single-measurement prediction and 
the “‘one-for-all” requirement of Laplacian prediction. The answer 
to this question is not known to us. 

It is imaginable that phenomena overly sensitive to initial condi- 
tions will ultimately be described in some different way, one that 
does not involve initial conditions of the type usually assumed. 
There is no unique way to try to understand natural phenomena, 
and if one method fails, scientists are usually driven to devise others. 
For example, a description that uses a segment of a past trajectory 
might allow for accurate predictions even in situations where know- 
ing a single point on the trajectory does not. It is probably prema- 
ture to conclude that the extreme sensitivity of the behavior of some 
Newtonian systems to initial conditions entails insuperable obstacles 
to accurate prediction. 


II.2. Other problems for Laplacian prediction. 


III.2.A. Requirements beyond initial conditions. In addition to 
knowing the initial conditions, prediction in Newtonian mechanics 
also depends on knowing certain fixed parameters of a system, such 
as the masses of its component particles, and the constants, such as 
coefficients of friction, that enter into the forces that act among 
them. Learning the values of these parameters for a specific system 
involves carrying out measurements similar to those involved in 
learning initial conditions. Therefore, the knowledge of these param- 
eters is subject to the same sort of inaccuracies as is the knowledge 
of initial conditions. Uncertainties in masses and forces can show up 
as uncertainties in the prediction of the future behavior of a system. 
Consider, for example, a body moving under the influence of a 
constant electric force. The acceleration will depend on this force 
and on the mass of the body. If those are not known with certainty, 
then the acceleration will be uncertain. The mathematical expres- 
sion for the position of the body at some future time will include a 
term proportional to the acceleration, and the magnitude of this 
term will grow as the square of the time, so that after a long time has 
elapsed even a small uncertainty in the acceleration will lead to a 
large uncertainty in the position. 

This discussion of uncertainty assumes that the mathematical 
form of the forces that act is known, and that only some parameters 
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in it are uncertain.’ Actually, if one wants to predict the position of a 
body at some future time, T, it is necessary to know the forces acting 
on the body at all times between the present time and T. Strictly 
speaking, we cannot be sure what these forces will be from measure- 
ments that we can do at some fixed time earlier than T, unless there 
is some theoretical input. If we assume that the system satisfies the 
conservation of energy, then the force function at a specific point in 
space is time-independent. The force function will usually vary from 
point to point, however, so that as the body moves it will be subject 
to different forces whose values cannot be anticipated from the past 
history of the body. In order to anticipate what forces will act on the 
body, one could use some theory of the spatial variation of the 
force, such as that provided by Newton’s law of gravitation for plan- 
etary motion. Alternatively, one could try to measure the forces that 
would act on a body at any point in space. This is essentially what is 
done in mapping magnetic lines of force by the use of iron fillings. 
But this procedure involves going considerably beyond the 6N meas- 
urements that are usually described as necessary for prediction of 
the motion of N bodies in Newtonian mechanics., 


HI.2.B. Unobserved bodies. In Newtonian physics, the behavior of 
objects in some region of space can be significantly influenced by 
objects that are far from the region. Ordinarily, one imagines that 
far away objects have no important influences on a region unless 
they are extremely large. Even small, far-away objects, however, 
have some influence on what happens through the small, long-range 
forces that they exert. Gravity is one example of such a force. Over a 
period of time, such small influences can change the behavior of 
systems in the same sort of way as a small change in initial conditions 
would. This means that in order to make precise predictions about 
motion over long periods of time, it is necessary to have a complete 
inventory of all of the distant objects that might be influencing the 
motion. In the case of the future motion of planets in the solar 
system, for example, it would be necessary to include the effects of 
other stars, and ultimately of other galaxies, if the predictions are to 
be extended sufficiently far into the future. If some of these distant 
influences are omitted, then the predictions will become increas- 
ingly less accurate as time goes on. Ultimately, since every mass in 
the universe can influence every other one at least through gravity, a 
precise description of the future motion of any body would have to 
include the effects of all other bodies. 


* Ernest Nagel, for example, made a related point concerning mechanical 
nation See The Structures of Saence (New York: Harcourt, Brace, & World, 1961), 
pp. 166-74. 
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This generality about prediction needs to be fleshed out by a 
quantitative statement about how significant the influence of small, 
distant objects can be. To get some idea of the size of the effect, 
imagine that there is an unobserved planet the size of the earth at 
the distance of the nearest star. The gravitational effect of this 
planet on the motion of the earth would correspond to an additional 
acceleration of about 10718 cm/s®. Such an acceleration, if it main- 
tained a fixed direction, would produce a displacement of the earth 
of about 10!° cm over a period of ten billion years, the expected 
future lifetime of the earth. That distance is 100 times the size of the 
earth’s orbit. This suggests that, if we want ultimately to extend 
predictions of the earth’s motion to time intervals comparable to the 
future lifetime of the solar system, then we will need to take into 
account the gravitational effects of distant bodies. While this does 
not introduce new problems of principle, it does seriously compli- 
cate the practical problems of prediction, since one does not even 
know of the existence of all the distant bodies whose influence may 
be relevant. One might reverse the logic and argue that inexplicable 
deviations from motions predicted on the basis of known influences 
can be attributed to specific additional influences and used as 
constraints to help reveal the latter. The different fates of the 
nineteenth-century predictions of Neptune and Vulcan give some 
indication of the likely success of this approach. As we wish to pre- 
dict the motion of any object, such as the earth, ever more accu- 
rately over greater time periods, we shall necessarily have to include 
the influence of ever more distant objects. Our ability to make accu- 
rate predictions will rapidly come to be limited by this need to in- 
clude in our analysis increasing numbers of objects that are increas- 


ingly difficult to observe. 
IV. THE CONSEQUENCES OF RELATIVITY FOR PREDICTION 


In our discussion of the effects of distant bodies on local predic- 
tions, we have implicitly assumed that the distant bodies always re- 
main far away, and that their effects are felt instantaneously. The 
first of these assumptions is not always true, however, and the sec- 
ond is never true, as a result of special relativity theory. The special 
theory of relativity therefore implies novel limitations to prediction 
which, though they may not be as profound as those of quantum 
theory, nevertheless force us to modify substantially what we can 
expect to say accurately about the future. 


IV.1. Rapidly approaching bodies. Suppose an asteroid that is now 
very far away is heading on a course toward the earth which will lead 
to a collision in 100 years. While the asteroid may have only a very 
small effect on what happens on earth in the short run, it will likely 
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have an immense effect on what happens during and after the im- 
pact. If we knew that the asteroid was headed our way, then we 
could try to predict what its effect would be. That problem is not 
really different from predicting the effects of any complex future 
event, which may be done with more or less accuracy. If we do not 
know about the existence of the asteroid, however, then our predic- 
tions about what will happen on the earth after the next 100 years 
are likely to be seriously wrong. 

In a world correctly described by Newtonian physics, information 
could be obtained about all the bodies that in some future time 
period will pass sufficiently closely to a given body to influence it 
strongly. A proposal to obtain just such information about asteroids 
that might collide with the earth in the future has been considered 
seriously. If this were done, then we could avoid most surprises of 
the type just described. 

It is one of the consequences of the special theory of relativity 
that at any given time it is impossible to obtain complete informa- 
tion about what may collide with us in the future. Therefore, we can 
never be sure that a big surprise does not lie just ahead. In order to 
see why this is so, let us analyze how information about distant 
bodies is obtained. Such information can be obtained by one of two 
methods, passive or active. We could passively monitor all incoming 
radiation, such as light, radio waves, neutrinos, and so forth that 
may be reaching us from afar, and use the data to infer the future 
motion of the bodies emitting the radiation and, in particular, 
whether any of them will collide with us. This procedure amounts to 
what happens when a pedestrian looks at other pedestrians on a 
street in order to avoid colliding with them. Alternatively, we may 
actively emit radiation, checking whether it is reflected back by un- 
expected objects whose presence would then be revealed. The fu- 
ture motion and possible future collisions with us could be inferred 
as in the first method. This is what is done when a ship uses its radar 
to detect other ships in a dense fog. 

Both methods are effective for the low relative speeds of bodies 
on the earth, or for known bodies within the solar system. They 
become less effective, however, when the bodies involved are travel- 
ing at near the speed of light. Suppose, for example, that a previ- 
ously undetected high-temperature jet of gas is approaching the 
earth on a collision course at % the speed of light. At some instant 
of time, when the jet is one hundred ten million miles away (a dis- 
tance it takes light about 600 seconds to travel), we send out a radar 
pulse at the speed of light, which reflects from the object and re- 
turns to tell us of its existence and of the impending collision. The 
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pulse will return to us about 800 seconds later, only about 400 
seconds before the gas jet itself hits. Until the radar pulse returned, 
with its warning of the collision, any predictions that we would have 
made about events even a short time after the collision, now only 
400 seconds away, could not have been correct because of informa- 
tion that was not available to us. If the jet were traveling at very 
close to the speed of light, we could get almost no advanced warning 
about it, either from passive or active signals. If the approaching 
object is moving at the speed of light, as would be the case for the 
radiation from a nearby supernova, we can get no advanced warning 
at all. 

This problem occurs because according to special relativity we 
cannot use signals traveling faster than the speed of light to detect 
distant bodies. There is no such limitation in nonrelativistic physics, 
and the problem does not occur in this form. By using the active 
method with emitted radiation of arbitrarily high speed, we can al- 
ways get advanced warning of an approaching body, except for the 
limiting case of a body that travels toward us at infinite speed. Never- 
theless, if the body is approaching at some very large finite speed, 
the warning may give us only a very short notice of the impending 
collision. 

It may be imagined that an observer who has been receiving sig- 
nals for some time can do a better job of predicting the future. 
Suppose that there is some object, S, that has already been observed 
at a time before the present time, tọ. If these observations are sufh- 
ciently detailed, then they can be used to predict whether S will emit 
something subsequent to the latest time that.it has been observed, 
which will reach the observer just after t). If these prior observa- 
tions imply that it will not do so, then the observer has reason to 
believe that no surprise awaits. For example, to the extent that as- 
tronomers are sufficiently confident of their theory of how stars 
undergo the massive explosions known as supernovas, they could 
use observations about the radiation emitted by some star to infer 
that the star will not become a supernova a short time after the latest 
observation. In other words, for a body that we have observed for 
some time, analysis of its past behavior can be used to predict its 
future behavior and makes it possible to determine whether any part 
of the body will eventually collide with us. It is worth remarking that 
such analysis introduces significant new theoretical content into the 
bases of our predictions. 


IV.2. Horizons. Even if our theories were perfect, the above type of 
analysis is not sufficient to rule out all surprises. In universes with 
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certain types of cosmological structures,’ objects may exist which 
could never have been observed by someone at a point, P, until the 
time, tọ. These objects may nevertheless emit signals that will reach 
P just after tọ. If the universe were finite in extent, had existed 
forever, and was “flat” in the sense of general relativity, then no 
such objects could exist. Such objects could, however, exist in a 
universe whose age is finite. 

Suppose, for example, that the universe is static, has an age, A, 
and extends to a distance, D, that is considerably larger than cA, the 
distance that light could have traveled since the beginning of the 
universe. Then any objects whose distance apart is greater than cA 
will have been unobservable to each other until the present. There- 
fore, any mutual influence that such undetectable objects might 
have in the future will be unpredictable until it actually occurs. In 
the actual universe, which is expanding, the situation is still more 
complicated, but most of the cosmological solutions of general rela- 
tivity which have been studied as models for our own universe in- 
clude “horizons” of the type just discussed, that is, horizons beyond 
which distant objects will not become observable until some specific 
future time. In a universe that contains cosmological horizons, it 
would appear that we cannot eliminate possible surprises about 
what will happen locally when the influence of presently hidden 
distant objects begins to be felt. 

It should be noted that there may be indirect ways to detect ob- 
jects that are beyond such horizons. For the case that has been most 
completely analyzed, that of black hole horizons, some information 
about what is inside the horizon can be obtained. For example, the 
total energy and electric charge residing inside can be learned from 
measurements of the gravitational field and electric field outside the 
horizon. Also, it has been suggested that quantum effects, such as 
the deviation from a thermal spectrum of the “Hawking radiation” 
emitted by a black hole, give additional information about the mat- 
ter inside the horizon.’ It is possible that similar effects exist for 
cosmological horizons, but this question has not yet been investi- 
gated. It is therefore premature to draw conclusions about the im- 
plications of horizons for prediction. 


IV.3, Retarded interactions. There is a further problem in relativis- 
tic theories which occurs even for systems that we can assume iso- 
lated over the period for which we wish to make predictions. In 


° The various concepts of horizons in relativity theory have been discussed by 
W. Rindler, Royal Astronomical Society Monthly Notices, CXVI (1956), p. 663. 
1 See, e.g., T.D. Lee, Nuclear Physics B, CCLXIV (1985), p. 437. 
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general, in such theories, the forces that act at present on an object 
depend on the past and present positions and velocities of all other 
objects within the system which can influence its motion. The role of 
past positions and velocities, rather than just present ones, arises 
because of the finite speed of light, which implies that the forces 
that act are retarded, that is, they depend on where the objects were 
rather than where they are. This means that the equation of motion 
of each object is a complicated integro-differential equation, involv- 
ing integrals over past times, rather than a simple differential equa- 
tion of the Newtonian type. This change in the form of the equa- 
tions of motion leads to a change in the information that is neces- 
sary for prediction from that of nonrelativistic mechanics, in which 
forces act instantaneously across any distance.® 

In a simple case, that of two particles interacting through electro- 
magnetic forces, the force depends on the distance between the 
particles at a specific time in the past, known as the retarded time, 
which is a function of their present distance apart and the speed of 
light. This implies that, if we wish to predict the motion of one 
particle over some interval of time beginning with the present, then 
we must know the position of both particles from the retarded time 
to the present. That is immensely more information than is needed 
in nonrelativistic physics to carry out the same type of prediction. 
Whereas in nonrelativistic mechanics we need to know conditions at 
a single time, in relativistic physics one seems to need to know corre- 
sponding conditions over a continuous period of time. 

It might be thought that the positions over the interval between 
the retarded time and the present could themselves be calculated 
from a knowledge of the positions and velocities at the retarded 
time alone. This calculation, however, would require knowing the 
forces that acted between the particles during this earlier time inter- 
val, which in turn would require knowing the positions at some still 
earlier retarded-time interval, and we are led to a requirement of 
knowing positions and velocities at an infinite number of discrete 
past times in order to carry out this procedure. 

It is instead possible to adopt the approach of field theory, in 
which the forces that act on particles depend on the present values 
of the fields where the particles are. If the values of fields and their 
first time derivatives are known everywhere at the present time, as 
well as the present particle coordinates and velocities, then the joint 
evolution equations of the particles and the fields can be used to 

* For a discussion of some of the mathematical questions arising from equations 
with retarded interactions, see the article by R. D. Driver, Physical 
CLXXVII (1969), p. 2051 
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determine their future values. This procedure also requires knowing 
much more than is required in nonrelativistic particle mechanics, 
since the values of the fields at all points in space at some fixed time 
involve a continuum of quantities. In addition, because of the ef- 
fects of horizons discussed above, it is in general not possible to 
determine all of these field quantities by measurements made at a 
fixed time. It is not necessary to know the fields at all points in space 
in order to calculate the force on the particles at any one time. If we 
want to know the force on the particles over a period of future time, 
however, we must know the present values of the fields even at 
points where the particles will never be, because those field values 
will influence the fields at future times at points where the particles 
will be. Thus, all of these approaches to prediction imply that even 
in the simplest case of special relativistic mechanics we require much 
more input to make predictions than would be needed in nonrelativ- 
istic particle mechanics. Instead of getting a large amount of infor- 
mation about the future from a small amount of information about 
the present, we need a large amount of information in order to be 


able to predict anything beyond the present. 
V LIMITS TO PREDICTION IN QUANTUM MECHANICS 


One of the most surprising developments in twentieth-century phys- 
ics has been the recognition that the fundamental laws that describe 
microscopic objects, and in principle macroscopic ones as well, are 
such that precise prediction is in most instances impossible, even for 
systems that are as simple as any that we know. Consider as an 
example a single unstable subatomic particle, such as a muon. On 
the average, a newly produced muon will decay in approximately 
two microseconds into several other subatomic particles. The time at 
which a specific muon will decay is not predictable according to 
quantum mechanics, however, even though that time can be deter- 
mined once the decay has taken place. As quantum mechanics is 
usually presented, there are several distinct aspects to the limits on 
prediction. Some of these arise from one consequence of the formal- 
ism of quantum mechanics, which is that the states that are usually 
prepared, such as that of a single muon, do not evolve over time into 
states with definite values of the quantities that we would like to 
predict. We shall see, however, that the same formalism implies that 
for any quantity that we can measure in the future, there exist states 
which, if not disturbed by intervening measurements, do evolve into 
ones with a definite value of that quantity. In most cases, however, it 
18 not at all transparent how these states can be prepared, or what 
their physical interpretation is. 
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Prediction in quantum mechanics is also constrained by the meas- 
uring process, which can introduce uncontrollable changes in the 
state of a system. As a result, any predictions about a system at a 
future time, ty, which are based on measurements done at an earlier 
time, say fj, can be upset by measurements of incommensurable 
quantities that are carried out at an intervening time, ¢,, such that 
ty < h < tp. As a result, it is generally impossible to predict beyond 
the next measurement on a system. These different aspects of quan- 
tum imprecision must be carefully disentangled in order to appreci- 
ate properly what can and cannot be predicted. 


V.1. Unpredictabiltty due to Heisenberg’s principle. The most cele- 
brated limit to prediction in quantum mechanics is a consequence of 
Heisenberg’s uncertainty principle, which implies that Laplacian 
prediction is usually impossible in quantum mechanics. Recall that 
in Newtonian physics, predicting the position of a particle at an 
arbitrary future time requires the precise knowledge of the position 
and velocity at some earlier time, or some equivalent information. 
But Heisenberg’s principle entails that no experiment can reveal 
this information.’ Any measurements of position and velocity are 
constrained by the relation AxAv > h/2xm, where h is Planck's 
constant, and m is the particle mass. In this relation, Ax is the accu- 
racy with which the position is being measured and Av the accuracy 
with which the velocity is being measured. The reciprocal relation 
between these accuracies allows for approximate prediction of fu- 
ture positions, but not exact predictions. The inherent imprecision 
of predictions of position are relatively unimportant for macro- 
scopic objects, because of the factor of m in the denominator of 
Heisenberg’s relation. To see what quantitative limits on prediction 
follow from Heisenberg’s principle, consider a single particle mov- 
ing through otherwise empty space. If the initial conditions are de- 
termined with maximum accuracy at time 0, then at a later time, t, 
the minimum uncertainty in the position will be: 


AXE) = (ht/ xm) 


What this means is that the possible position of the particle at time t 
will be distributed about some average value, with a “width” given 
by Ax(t)um- Note that Ax(t),., increases indefinitely with time, so 
that eventually, unless some further measurement intervenes, the 


° These implications are worked out in some detail ın Werner Heisenberg’s The 
Physical Principles of the Quantum Theory (Chicago: University Press, 1930). 
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position is completely uncertain. For an electron, this width Ax(f) min 
corresponds to about 1 cm at a time of one second. In other 
words, the future position of an electron in empty space be- 
comes completely unknown in a relatively short time. For a virus 
(m = 1071 gm), the uncertainty width is about 10™ cm, or about 
one viral length, 300 seconds after the initial measurement. So to 
the extent that a virus could be treated as a quantum mechanical 
particle moving in empty space, its future position could not be 
predicted accurately even a few minutes into the future. 

Several comments need to be made about these results. First, that 
what holds for position does not necessarily hold for other physical 
properties. The momentum of an electron in empty space, once 
measured, is known at all future times, unless further measurements 
of quantities incommensurable with momentum are performed. 
This is because the momentum does not change in the undisturbed 
evolution of an electron in empty space. Indeed, for any conserved 
quantity, such as the total energy of an isolated system, a single 
measurement is sufficient to determine its values for all future 
times. Here the situation is the same as in Newtonian physics. Sec- 
ond, we have idealized the situation by assuming that the object 
whose future prediction we wish to predict is in empty space. The 
situation is somewhat different if it is acted on by forces, since those 
forces may tend to restrict its motion, including the “motion” that 
occurs as a result of uncertainty in the initial velocity. An example of 
this is given below. 

The situation in quantum mechanics is completely different with 
regard to single-measurement prediction of the next measurement 
to be made on a system (cf. fn.3). Such predictions can be made in 
precisely the same way as in Newtonian physics. This is because, 
according to quantum mechanics, the wave function of a particle 
evolves in a time-reversible manner. For example, if a free particle 
can be prepared at time zero to have the wave function which is the 
one which evolves from that of a particle with a definite position at 
time —i, except that the momentum is reversed, then at a time +t 
this particle will have a definite position, provided that there are no 
other measurements on the particle between time zero and +t. This 
will be true whatever the forces that act on the particle, although the 
actual wave function that must be prepared at time zero will depend 
on these forces. These results imply that there is a specific measure- 
ment, of a quantity formally analogous to that required for single- 
measurement prediction in Newtonian mechanics, which if carried 
out at t = 0 will allow the position of the particle at the. specific 
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future time +t to be predicted with certainty. Of course, just as in 
Newtonian mechanics, the measurement done at ¢ = 0 will not suf- 
fice for the prediction of the position at any time other than +t, 
although it will allow for the prediction of some function of position 
and momentum at various future times. Furthermore, the particle 
will in general never have a definite momentum, so that even in this 
case the prediction is much more limited than in Newtonian physics. 
Finally, for single-measurement prediction in quantum mechanics, 
there remains the question as to whether the function of position 
and momentum which must be measured at ¢ = 0 in order to predict 
the position of the particle at time +t actually can be measured. 
With these very important qualifications, quantum mechanics may 
also be thought of as a deterministic theory, in that by suitable 
measurements now, it is possible to predict with certainty the result 
of any specific measurement that is to be done at a predetermined 
time in the future. 

Another case in which exact prediction of future position is possi- 
ble is when certain forces act on a particle. For example, for a 
particle bound by a quadratic potential, the so-called “harmonic 
oscillator,” if it is prepared to have a definite position at time zero, 
then it will also have a definite position at any later time that is a 
multiple of the basic period, T, of the harmonic oscillator. Thus, if 
we measure the position as x = xy at ¢ = 0 and allow it to evolve by 
the equations of motion, we can safely predict that at the future 
time T, the position will again be xp. Again, this prediction is only 
possible at specific future times, and does not hold for momentum. 
Furthermore, the uncertainty in position will not increase indefi- 
nitely in time in this case, but instead oscillates between definite 
limits. 

These forms of quantum determinism do not appear to avoid the 
problem of limited knowledge of the future. Part of what we mean 
by unlimited knowledge of the future is to be able to anticipate what 
will actually happen. If we do not know now what measurements will 
actually be done some day, we are unable to use single-measurement 
prediction to anticipate their results. Moreover, in most cases sin- 
gle-measurement prediction only allows us to predict the result of 
the next measurement on a system, not those of subsequent 
members of a series of measurements. In the standard, collapse 
interpretation of quantum mechanics, no present measurements will 
allow us to predict precisely the result of future measurements that 
are separated from the present situation by other measurements of 
incommensurable quantities. 
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V.2. Unpredictability in particle decay. Another restriction concern- 
ing predictability contained in quantum mechanics is best typified by 
the decay of an unstable state. At first glance, this might also be 
thought of as due to a lack of knowledge of initial conditions. A 
uranium nucleus that decays by alpha-particle emission is a complex 
structure containing many neutrons and protons, and one could 
imagine that, if the precise motions of these particles were known, 
the time at which the nucleus will decay could be predicted. Of 
course, Heisenberg’s principle forbids us from doing this, but we 
might try to use single-measurement prediction, for example, to 
infer whether the particles that are now within the nucleus will be 
outside of it at some specific future time. While this can be done in 
principle, it does not really satisfy what we would ordinarily think of 
as a prediction about whether the nucleus will decay. The reason is 
that there is a high probability that the nucleus will be disintegrated 
simply as a result of the measurement needed to predict the future 
position of the particles that are now in the nucleus. While quantum 
mechanics does not forbid us from carrying out this measurement, it 
implies that the measurement will transform the bound collection of 
Particles into a new state that will not resemble the nucleus very 
much. It is this new state that will evolve in such a way that we can 
predict the future positions of the particles in it, not the original 
bound nucleus. Hence the prediction allowed by quantum mechan- 
ics is not the one that we would have liked to be able to make. 
There are other circumstances in which decays appear 'to be un- 
predictable in an even more fundamental way than for an unstable 
nucleus, where it seems to be the initial conditions that forbid the 
prediction. Some unstable systems are not complex in the sense that 
a nucleus is. A single subatomic particle, such as a muon, is as simple 
a system as is known to us. Nevertheless, its decay follows the same 
statistical law as does the “complex” alpha decay, and the decay of 
an individual muon appears to be as unpredictable as that of an 
individual uranium nucleus. The decay of such a particle is de- 
scribed by an extension of quantum theory to fields. The decay can 
be thought of as a transition between two different states of coupled 
quantum fields. This transition takes place according to the 
Schrödinger equation applied to the coupled fields. According to 
that equation, the state of the entire system is a superposition of two 
possibilities. In one of them, the system remains in its initial configu- 
ration, corresponding to the presence of the unstable particle. In 
the other, the system is in the configuration in which the unstable 
particle is gone, and its decay products are present. The relative 
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probability of these configurations changes with time. In the usual 
description, the probability of the first configuration starts out as 
one and predominates at early times, while that of the second starts 
at zero and predominates at later times. In that description, the 
ratio of probabilities changes smoothly with time. The probability 
that the system remains in the undecayed state decreases exponen- 
tially with time, and there is no sharp instant corresponding to the 
decay. It is therefore impossible to predict the precise time at which 
the transition takes place. Instead, if the system is monitored by 
some external measuring instrument, there is a time-varying proba- 
bility that this instrument will record that the decay has occurred. 
Typically, there is a substantial probability of decay over some range 
of times which is approximately equal to the mean lifetime of the 
decaying state. In other words, if a particle decays with a mean 
lifetime of two microseconds, then a given example of that particle 
has a reasonable probability of decay anywhere between about zero 
and four microseconds after it is produced. This situation is very 
similar to a measurement of position in a state that is a superposi- 
tion of states with definite position, where we can know in advance 
only the relative probability of different results. 

It might appear that there is a significant distinction between the 
two cases, because decays take place spontaneously while position 
measurements require outside intervention. Actually, the distinction 
between the two cases is not so absolute. The determination that a 
decay has taken place also requires a measurement, for example, 
one that detects the decay products. In ordinary circumstances, 
these measurements are made intermittently, on a time scale that is 
long in comparison with the internal dynamics of the decay process. 
In that case, we can think of the measurement as determining, at a 
series of times, whether the system has decayed, much as we might 
measure the position of a particle at a series of times. In between 
these times of measurement, the system evolves without reference to 
the measuring process. But just as for position measurement, the 
determination of whether the decay has taken place does affect the 
system, and its subsequent evolution occurs under a new set of ini- 
tial conditions. Nevertheless, the result of this process is the predic- 
tion that the probability that the initial system is intact decreases 
exponentially with the time that has elapsed from its original produc- 
tion. If the measuring process takes place on a time scale compara- 
ble to that of the dynamics of the decay process, it may result in the 
transfer of large enough amounts of energy to or from the decaying 
system to influence the decay process. In that case, there may not be 
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an exponential time dependence for the decay.!° But monitoring the 
system over such a short time scale is difficult to accomplish for 
subatomic particle decays. 

It is not only for decays that the problem of predicting when 

transitions take place exists, although they illustrate it most graphi- 
cally. Actually, any transition from one quantum mechanical state to 
another, whether it takes place spontaneously, as in a decay, or as 
the result of some outside perturbation, can be predicted only sta- 
tistically, except in very special circumstances, as when there is a 
very large outside perturbation. The reason for this is the law that 
governs how any system governed by quantum mechanics evolves in 
time. According to that law, if we express the initial state as a super- 
position of states with definite total energy, then each term in the 
superposition will change over time through being multiplied by a 
time-dependent complex number of magnitude one. This later su- 
perposition can be reexpressed in terms of whatever states corre- 
spond to the measuring process, such as an undecayed particle and 
its decay products. The coefficients in this expansion in terms of the 
states to be measured, which depend on time, then represent the 
probability of transitions. Usually, these coefficients are different 
from zero and one, which means that the probability of a specific 
transition is not zero or one, but some intermediate value. As a 
result, the transition does not take place with certainty at a specific 
time, but may take place at any time for which the expansion coeffi- 
cient does not vanish. The occurrence of a decay at a specific time is, 
according to this analysis, not very different from the actualization 
of a specific position for a particle as the result of a position meas- 
urement on a state’ that contains a superposition of different 
positions. 
V.3. The significance of unpredictability in quantum mechanics. It is 
undeniable that according to the existing form of quantum mechan- 
ics, future events cannot be precisely predicted. In the standard 
formulation, this is simply a brute fact about the world. According 
to this formulation, the notion of previous generations of physicists 
—that the future is predictable—was simply an overfacile general- 
ization, based on the approximate description of large-scale phe- 
nomena that we know as Newtonian physics. While we might wish 
that we lived in a predictable world, we have no choice but to accept 
nature as it is, and live with the consequences. 


10 A review of the literature on this effect can be found in Anthony Sudbery, 
Quantum Mechanics and the Particles of Nature (New York: Cambridge, 1986) 
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Alternative formulations of quantum mechanics, such as that due 
to DeBroglie and Bohm,’ do not change the conclusion that the 
future cannot be predicted, although they modify the reasons for it. 
In some of these theories, the measurement process does not itself 
lead to additional constraints on prediction, and all unpredictability 
is the result of insufficient knowledge of initial conditions. In some 
formulations, this lack of knowledge seems to be similar to that in 
classical statistical mechanics, in that it describes the world as being 
in one of a large number of similar configurations, together called 
an ensemble, in which initial conditions vary from one to another. If 
we could somehow determine which member of the ensemble we 
were in, we could then make exact predictions about the future. But 
these alternative formulations contain no clue as to how this might 
be done, even in principle. Indeed, it can be proven that in order to 
do so, something would have to be changed about the Schrödinger 
equation, which describes how the quantum mechanical wave func- 
tion changes with time. Theories that do this have occasionally been 
put forward, though not always in order to make the future predict- 
able. If such a change were made, we would be dealing with a new 
theory, rather than another formulation of the same theory. Presum- 
ably, this new theory would imply that there were some additional 
measurements which could be made which would enable us to make 
precise predictions about some future event, such as the decay of an 
individual subatomic particle. There is no clue in anything 
we now know about what these additional types of measurement 


could be. 
RETRODICTION 


A large number of the events that will ever happen have already 
happened and we cannot plan to observe these events as we can for 
future events. An important question for human knowledge con- 
cerns the extent to which we can recover information about past 
events on the basis of present and future measurements. 

While the limits for prediction which occur in Newtonian and 
quantum theory have been the subject of a great deal of analysis, 
there has been relatively little analysis of the corresponding limits 
for retrodiction. Such limits nevertheless exist, and while they are 
different in origin and scope from those for prediction, they are as 
important for an overall evaluation of what we ultimately can or 
cannot know. There are two types of retrodiction which need to be 
considered, which we will refer to as evolutionary and nonevolution- 


1 David Bohm, Physical Review, LXXXV (1952), pp. 166, 180 
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ary retrodiction. Evolutionary retrodiction is very similar to the pro- 
cess used for prediction, which we have discussed above. It can be of 
the Laplacian or of the single-measurement type. For simple me- 
chanical systems, such as the sun and planets taken as mass points, 
the same equations that determine future motions in terms of pres- 
ent conditions also can be integrated backward in time to give the 
past motions that led to the present conditions. Most of the consid- 
erations involved in such retrodiction in Newtonian physics are 
quite similar to those relevant to prediction, although this is not the 
case in quantum physics, to which we return below. 

A much greater difference between prediction and retrodiction, 
even in the nonquantum case, comes about, however, through the 
existence of alternative means to recover the past by the use of 
records. So far as we know, there is no analogue for knowing the 
future which corresponds to the use of records to know the past. 
Usually, one thinks of this difference as being a consequence of the 
fact that the past has already occurred while the future has not. We 
believe that instead of being a consequence of this asymmetry be- 
tween past and future, the existence of records defines the asym- 
metry. We shall refer to any procedure using such alternative meth- 
ods as nonevolutionary retrodiction. 

VI HOW CAN THE PAST BE RECOVERED? 
Some past events leave obvious records of themselves, as John Ken- 
nedy’s inaugural address did, both in the brains of those who heard 
it and on many television tapes. Other events seem to vanish without 
a trace. The glancing of a ray of sunshine off the surface of the 
Pacific Ocean at 9 A.M. on May 27 in the year 1,000,001,933 B.C. 
would seem to have left no record of itself which can be recovered 
by anything that we can do now or later. There are intermediate 
situations in which some event left a record when it occurred, but 
this record has not been preserved, so that it is not available for 
present and future knowledge. An example would be a footprint left 
by someone walking on a beach, which remained for a time before 
being erased by the incoming tide. This simple distinction between 
recorded and unrecorded events, however, masks some subtleties 
about the physical world which need to be explored before we can 
decide how much of the past can be recovered. Does any event really 
occur without leaving a recoverable record of itself? May it be that, 
if we were to examine the present with sufficient care, we could find 
in it records of all that has ever been? For example, consider as an 
event a person alone in a room speaking otherwise unheard words 
“to the four walls.” After these words are uttered, they become 
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patterns of sound, that is, ordered motions of the molecules of the 
air in the room. For a short time after the words are spoken, by 
using sensitive devices for measuring air pressures and other physi- 
cal quantities we could determine the precise pattern of sound 
which was emitted and so learn what was said. Indeed, the ear of 
another person, or the microphone of a tape recorder do just that. 
So, for this short time, a trace does exist of the words. But now 
imagine waiting for a while, say five minutes, so that the sound of 
the words is ostensibly no longer present, and a listener or a micro- 
phone will not be able to pick up the words. What has happened? 
The sound waves have interacted with objects in the room and with 
the walls. This interaction has changed the character of the waves so 
that there is no longer any very orderly motion of the air molecules 
related to the words originally uttered. Their trace seems to have 
vanished. Yet this cannot be the whole story. The phenomenon of 
reverberation shows that in a suitably prepared enclosure the trace 
of a voice or other sound can last much longer than usual and still 
be heard or otherwise recorded. 

Even if the room does not have special properties, there is reason 
to think that some trace remains of any sound produced in it. To see 
what leads us to this conclusion, let us follow the same type of 
analysis for retrodiction which we have done above for prediction. 
In a completely deterministic world, such as one governed by New- 
tonian mechanics, the present state of the objects in the room is 
precisely correlated with their state at any instant in the past. (We 
assume here that the room is isolated from all outside influences, an 
assumption which may well be invalid in practice. We return to this 
issue below.) For such a world, just as we can predict the future with 
the aid of sufficiently exact present measurements, so we can retro- 
dict the past with the aid of these same measurements. In this world, 
by making sufficiently accurate measurements on the objects in the 
room and using the laws of evolution, we could learn what was said 
in the room five minutes or even ten years ago. We have called this 
process evolutionary retrodiction. 

Now, imagine the following sequence of events. An event occurs 
somewhere. For a short time thereafter, the consequences of the 
event are well-correlated with the event and only a small number of 
physical objects have been influenced by it. Measurements on this 
small set of objects are sufficient to carry out evolutionary retrodic- 
tion. After some time has elapsed, however, the effect of the event 
will become more diffuse and involve a large number of objects. In 
this case, evolutionary retrodiction must be applied to a much wider 


632 THE JOURNAL OF PHILOSOPHY 


range of physical objects in order that an analysis of this process can 
lead to an inference of what happened. Thus, evolutionary retrodic- 
tion becomes more difficult as more time elapses after the event 
took place, because a wider range of objects becomes involved in the 
causal chain that begins with a specific event. A related reason why 
the traces of past events become increasingly difficult to detect as 
time passes is that most systems are open rather than closed, that is, 
they interact with the external world. As a result, as more time 
elapses, the effect of any past event is diffused over a wider and 
wider spatial region. In order to specify an event completely by 
measuring its later effects, it therefore becomes necessary to ob- 
serve this wider region. The moving air molecules produced when a 
word is spoken collide with other molecules, which in their turn 
collide with still others, somewhat like the child’s toy consisting of a 
number of steel balls suspended from a common support. In order 
to recapture the original sound impulse after some time has passed, 
it is necessary to observe most or all of the molecules that have been 
struck by the originally moving group, or by any that have been set 
into motion by them. If the room in which the sound is uttered is 
not very carefully insulated from the outside, this set of molecules 


that must be observed rapidly becomes immense. 
VII THERMODYNAMIC IRREVERSIBILITY AND RETRODICTION 


As indicated above, for the world as described by Newtonian me- 
chanics, it would always be possible in principle for a closed system 
to recover the information about past events by careful measure- 
ments of the present state of things, combined with a use of the 
equations of evolution. By knowing in sufficient detail how things 
are now, we could not only determine what the future will surely be, 
but also what the past must have been. One major reason why such 
evolutionary retrodiction is not generally applicable to our universe 
is the phenomenon known as thermodynamic irreversibility. This is a 
fancy name for something very familiar. Imagine two bodies of 
water, one hot and one cold. When the two are mixed, after a short 
time there results a single body of water at some intermediate tem- 
perature. The mixing process has resulted in what physicists call an 
irreversible transition. Despite the name, this does not mean that 
the effect of mixing cannot be undone. Obviously, by dividing the 
mixture in two and applying heat or other forms of energy to the 
parts, it is possible to reproduce the two original bodies of water at 
their different temperatures. What the word ‘irreversible’ does 
mean is that this reversal will not occur spontaneously in an isolated 
system without the need for an external energy source, in the way 
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that the original change to a uniform temperature takes place. Most 
changes that occur in nature are irreversible in this sense. The signif- 
icance of irreversible transitions for retrodiction is that usually when 
an irreversible change takes place, the resultant system cannot be 
correlated to a unique predecessor. A single mixed body of water at 
a uniform temperature of thirty degrees Centrigrade might have 
come about by mixing two bodies of water whose temperatures were 
twenty and forty, or ten and fifty; or an infinite number of other 
possibilities. Once the mixed water has settled down to what is called 
equilibrium, it is impossible just by inspection of it to determine 
what bodies were combined to produce it. Even though the com- 
bined system can be returned to a situation in which its two parts are 
at different temperatures, there is no unique way of doing this; and 
if we do not know its history, we cannot expect to reproduce its 
actual past even by applying an external energy source to it. For a 
reversible change, on the other hand, it is simple to find out in what 
state the system undergoing the change originated. It is only neces- 
sary to let the change reverse itself, or to return the environment to 
its original state, and the state of the system which is reached when 
the environment is back in its original state is also the one in which 
the system started. So for this case at least there is a direct correla- 
tion between irreversibility and loss of information about the past 
state of a system. More generally, whenever irreversible processes 
occur, there is a loss of information about what existed previously. 
It is this loss of information that is responsible in many cases for the 
rapid decrease in our capability to retrodict about real physical sys- 
tems. The orderly motion of air molecules produced by spoken 
words in the environment of the room rapidly becomes converted 
into disorderly motions of the type we call heat. Unless some kind of 
recording device is present which retains a trace of the orderly mo- 
lecular motions, it seems impossible to recover the sounds even in 
an isolated room. This is the conventional wisdom about irreversible 
changes, but is it really so? It is possible to find examples of systems : 
that undergo changes which seem very much like the irreversible 
processes that have been discussed, which nevertheless can be 
treated so that the original situation is exactly restored without the 
need for advance knowledge of what this situation was. Is it possible 
for any system to make a sufficiently accurate appraisal of its present 
state to distinguish among the many paths that might have led up to 
that state? 

Even within Newtonian physics, the answer seems to be “no,” if 
the system follows the kind of chaotic behavior described previously 
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in connection with prediction. If the Newtonian equations for a 
system that exhibits such deterministic chaos are reversible, then the 
system will exhibit a related behavior for retrodiction. Therefore, if 
we know, say by observation, that the system is now in some configu- 
ration—for example, described by its being in some region of its 
phase space—and we wish to trace the system backward in time, 
then we will find that the trajectories in phase space diverge rapidly 
into the past as well as into the future. This will imply that a short 
time before we have observed the system, it may have been in one of 
a number of configurations which are widely different from one 
another, and we will have no way of knowing which of these earlier 
configurations was the actual one. One way to say this is that, for a 
chaotic system, there are many roads from many starting points 
which can lead to the same destination; whereas for a nonchaotic 
system, knowing where it ends up specifies fairly precisely where it 
started and the road that it took to get there. It is tempting to 
conclude that irreversibility in general is the consequence of chaotic 
dynamics, but that remains to be shown. 

In situations where the dynamics effectively lead to a loss of infor- 
mation, whether as a result of chaos, irreversibility, or openness, 
one must use other methods to learn about the past of the system. 
In this context, openness can have positive as well as negative impli- 
cations for retrodiction. In our previous example, the expansion of 
the sphere of influence of the original sound can make it easier 
rather than harder to recover it after a time has elapsed. Suppose, 
for example, that the sounds, upon leaving the room in which they 
were uttered, reach the ears of another person while they are still 
loud enough to be audible. In this case, a new trace of them will 
have been made which will be recoverable for a much longer time 
than the original one was. Even if there is no human listener to hear 
the words, one can imagine nonliving objects that might be affected 
by the sound in such a way as to record the words for future detec- 
tion. An example of this was the suggestion a few years ago that 
certain ceramic utensils might have encoded fragments of conversa- 
tions that took place centuries ago while the utensils were being 
made. If the universe were dense with objects that can act as record- 
ing instruments, then as time passed the chances would become 
increasingly great that any event would eventually influence one of 
these instruments, and so be recorded. There is an analogy for pre- 
diction to the problem that an ever wider set of objects must be 
considered for retrodiction. When we try to predict what will hap- 
pen in a specific place, then as we go further into the future an ever 
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greater number of objects in the present may influence this future, 
and so its prediction requires a knowledge of more and more about 
the present. 
VIII. RECORDS 

So far as we know, the influence of most events does not eventually 
happen upon some medium that will make a stable record of the 
event. It may be, however, that we are unaware of some of the ways 
in which such traces can be formed and eventually read. Scientists 
often find new ways of making inferences about the past based on 
novel forms of recording. A recent example is the suggestion that 
there may be evidence of the supernovas that have taken place in 
our galaxy over an extended period of time recorded in the abun- 
dances of certain rare isotopes that have been produced on earth by 
the strong flux of neutrinos that these supernovae emit. 

What are the general characteristics of systems that can produce 
records of events that stimulate them? By a record, we shall mean 
some configuration of a physical system, usually fairly small, from 
which, by using generally accepted ideas such as physical laws, we 
can infer with high probability that a specific event occurred at 
particular time and place in the past. Records are easy to exemplify 
with human artifacts. An example would be a videotape showing the 
Mt. St. Helens volcano erupting at a time identified by a specific 
configuration of planets in the background sky. Records can also 
exist without involving human artifacts, although those are often 
less time specific. An example is the footprint of some animal on a 
mud flat, which has become preserved through the drying out of the 
mud. This record indicates that the animal has passed, but not when. 
But one could, for example, imagine some analysis that would corre- 
late the age of the footprint with some of its current physical char- 
acteristics, which would allow for the dating of the animal’s passage 
as well. Something similar to this can actually be done using radioac- 
tive dating. 

What is it about physical systems that allows them to be used as 
records?!? At least two characteristics are required. 


l. The system should have various states—the “recording states’ — 
that can be produced by different stimuli. In other words, it should 
not be that all stimuli produce the same record. Of course, ideally, 
the system should have many states that are variously produced 


12 The modem scientific consideration of the nature of records was initiated by 
Hans Reichenbach in The Dtrection of Time (Berkeley. Cahfornia UP, 1956) More 
recent discussions can be found in Paul Horwich, Asymmetnes m Trme (Cam 
bridge: MIT, 1988), and references therein. 
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depending on interesting aspects of the stimuli. This requirement 
can be satisfied to varying degrees. Even for humanly designed re- 
corders, such as particle detectors, it is possible for several different 
Particles to trigger the same response. For a naturally occurring 
system to be useful as a detector, one can require only that the 
ambiguity among stimuli is relatively small. 

2. The recording states must remain “stable” for a relatively long time 
so that the record persists and can be read. Stability is required 
against change to another state that would be interpreted as the 
result of a different stimulus. There are a number of ways to assure 
stability. The need for stability seems to require that any physical 
system that can act as a recorder should have a discrete set of record- 
ing states with some strong barrier against spontaneous changes 
between states. The barrier is frequently a requirement of a large 
amount of energy to induce a transition between the states. Other- 
wise, small perturbations from the environment will tend to cause 
the recording system to make transitions between the recording 
states, thus destroying the record. 


Normally, recording devices designed by humans have a metasta- 
ble ready state (an unexposed photographic film, for example), so 
that the interaction with an external system has a high probability of 
changing it into another state. The recording states (developed film, 
for example) are more stable, so that once in such a state the record- 
ing system does not easily emerge from it. The ready state makes it 
possible to record weak stimuli, even when the barrier between re- 
cording states is a high-energy barrier. But it is not necessary to have 
a ready state. For example, we use the state of the surface of the 
moon to estimate the rate of meteor impacts, even though it takes 
no less energy to crater the unmarked surface than it does subse- 
quently to put new craters on the already marked surface. 

Suppose such a recording system interacts with another ‘system 
external to it, and makes a transition into one of its stable states. 
Then some time later, inspection of the recording system will allow 
for the inference that such an interaction has taken place in the past, 
although it will usually not specify when the interaction occurred. 
Of course, some theory is involved in this inference, even in a case as 
simple as determining which animal is responsible for an ancient 
footprint on a beach. Generally, the more complex the recording 
system, that is, the more distinct recording states it has, the better 
the chance of inferring the precise nature of the stimulus that pro- 
duced the record. Alternatively, it may be necessary to use input 
from several recording systems in order to narrow down the possible 
stimuli. It would be very interesting to describe all of the physical 
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systems that can act as recording systems. Such a description, to- 
gether with an idea of the rate at which external systems stimulate 
the recording systems, could lead to an estimate of the total rate at 
which events are being recorded. 

In any event, these are patently in the nature of very preliminary 
remarks. A rigorous, comprehensive, and foundational investigation 
of the nature of records, of the conditions under which records 
become possible, and of the important question of why these condi- 
tions should obtain only in one time direction and not in the other 


has yet (so far as we know) to be undertaken. 
IX. RETRODICTION IN QUANTUM MECHANICS 


The process of retrodiction is importantly different in quantum me- 
chanics than in the macroscopic cases that we have been discussing 
so far. One reason for this is the fundamentally probabilistic effect 
of carrying out a measurement on a quantum mechanical system. The 
crucial point is that laws governing the forward-in-time evolution of 
the quantum state of the world through a measurement event are 
(on the standard formulation) probabilistic laws, and so (in contrast 
with the case of Newtonian mechanics) those laws of forward-in-time 
evolution through a measurement will entail no laws at all (probabil- 
istic or otherwise) for the backward-in-time evolution of the quan- 
tum state of the world through such an event. 

Suppose, for example, that the position of a certain particle is 
measured at a certain time. The outcome of that measurement can- 
not (in the absence of other information) be used to infer the posi- 
tion of that particle, or even its probable position, or even to infer 
probabilities that any of its observable properties had any particular 
values, at times (no matter how close) prior to the measurement. 

In general, according to quantum mechanics, we can only make 
statements about the situation between the times that measurements 
have been made, and even these statements are different than they 
are for predictions. Furthermore, deciding when something has 
happened is much more problematic in quantum mechanics than it 
is in Newtonian mechanics. For an isolated system with a definite 
energy, evolving according to the Schrödinger equation, quantum 
mechanics states that nothing actually happens. The average value 
of any physical quantity is independent of time. This is quite differ- ` 
ent than the corresponding situation in Newtonian mechanics, 
where, for example, the position of a particle can change even 
though its energy is constant. Of course, only certain quantities have 
definite physical values in the quantum mechanical situation, those 
which can be measured simultaneously with the energy. For these 
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quantities, once their value is known at some time, the situation is 
about the same as in Newtonian mechanics. But for other quantities, 
such as the position of an electron in an atom, in a stationary state 
there is no evolution similar to what happens in Newtonian me- 
chanics. 

If we want to discuss retrodiction in a stationary state for quanti- 
ties that cannot simultaneously be measured with the energy, we 
might imagine that we could make use of the language of counter- 
factuals. That is, we can ask what would have been the result had a 
measurement been done at some time in the past, even though such 
a measurement has not been done in the actual world. For example, 
we can ask what would have been the result of a position measure- 
ment of an electron at some time in the past, knowing that the 
electron is definitely in a specific stationary state of an atom and has 
been so since the last measurement was made on it, which we assume 
was prior to the time for which we are asking the counterfactual 
question. For this type of question, the answer given by the formal- 
ism of quantum mechanics is very similar to the answer it gives for 
prediction. That is, quantum mechanics states the probabilities that 
had the measurement been done, it would have resulted in various 
values of the position for the electron. 

This similarity in formalism, however, hides very important con- 
ceptual differences between the predictive and retrodictive ques- 
tions. If the position measurement has not been performed in the 
past, then there is no further possibility of doing it, so that the 
counterfactual retrodiction is entirely sterile, while for the predic- 
tion of future positions the possibility exists of actually carrying out 
the measurement and so testing the prediction. Moreover, if the last 
measurement made on the electron had instead been a position 
measurement, then the electron cannot have been in a state of defi- 
nite energy subsequently. Even if a later measurement of the elec- 
tron’s energy is carried out, then, as stressed earlier, this measure- 
ment will not imply that the electron had the energy implied by the 
measurement beforehand, and therefore the result of the energy 
measurement cannot be used for retrodiction in the simple way that 
it can be used for prediction. 

In some sense, if a quantity has not been measured in the past, 
then according to quantum mechanics it has no value at all. One 
might go so far as to say that according to quantum mechanics, at 
least in its standard version, the past consists of a discontinuous 
series of events when measurements have taken place, with nothing 
happening in the intervals between these measurements. A good 
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analogy to the quantum mechanical picture of the past is a room 
that is filled with objects but is usually darkened. From time to time, 
a light turns on briefly illuminating the objects, but then goes out, 
returning them to the darkness. 

This suggests that, in quantum mechanics, retrodictions should be 
of a different type than we have considered thus far. It appears that 
in order to make valid retrodictions in quantum mechanics, it is 
necessary to make use of actual measurements at two different 
times. Suppose that for a particle, one quantity, such as position, has 
been measured at time —T. If we know the results of that measure- 
ment, we can then infer the value of the time-evolved quantity 
corresponding to position at all subsequent times until another 
measurement has been made. Suppose, however, that we do not 
know the result of the position measurement, but only that it has 
been made. All that we know is that the particle had a definite value 
of position directly after the measurement. We can ask what further 
measurements to be done at t = 0 will allow us to make precise 
statements about the particle at times between —T and 0. The an- 
swer is that, if we measure one specific quantity, which is the posi- 
tion variable evolved from —T to ¢ = 0, then from the result of this 
measurement we can infer the result of the original measurement of 
position at time —T. This is directly analogous to what we can do for 
single-measurement prediction. We can also, from the same results, 
infer the value of a sequence of quantities at intermediate times, 
even if those quantities have not been measured. These quantities 
are what the position operator evolves into at those times between 
—T and 0. In general, they are complicated functions of position 
and momentum, and will not be easy to measure. These quantities, 
however, and their analogues for other pairs of measurements, ap- 
pear to be the only ones for which we can have precise past knowl- 
edge ‘‘counterfactually,” that is, knowledge that does not depend on 
some measurement having been made at the specific past time that 
the knowledge is sought. 

In some sense, quantum mechanics implies that the past does not 
exist, except as a result of whatever measurements have been 
carried out, and which we can do now and in the future. Since we no 
longer have any control over what measurements have been done in 
the past, then the only thing that we can still influence is what meas- 
urements we will do now and in the future. A proper quantum 
mechanical reconstruction of the past would involve the determnina- 
tion of as much as we can on the basis of the sequence of measure- 
ments that have actually been carried out. If we make a suitable 
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present measurement, we can use the double-measurement retro- 
diction which has just described to infer something definite about 
the system between the time of the last measurement and now. For 
the other measurements in the sequence which have been made, it 
will usually not be the case that they are related in the proper way to 
allow us to use single-measurement retrodiction during the times 
between them. 

All of this assumes that there is some way to tell what measure- 
ments have been done in the past on some physical system. In gen- 
eral, it is not possible to do this by examining the physical system 
itself in the present, at least for microscopic systems. While the state 
of such a system will be influenced by such past measurements, 
there is no way to infer uniquely from the state what measurements 
have been done. Furthermore, the state of a system cannot be deter- 
mined just by examining the system itself. In most situations, such 
examination will change the state, and will not allow conclusions 
about the state prior to the measurement. Usually, in order to de- 
termine the state and to decide what measurements have been 
made, we shall need to examine the measuring instruments that 
have been used. That is, we shall have to examine the records of past 
measurements. The notion of a measurement which we are using 
here is a broad one. It involves any interaction between the system 
of interest and some other recording system, usually macroscopic, 
such that by examination of the recording system we can infer some 
definite characteristic of the system of interest. The record of the 
measurement is just the condition of the recording system after the 
interaction has taken place. There seems to be something of an 
infinite regress here, in that in order to determine the condition of 
the recording system, we might need to make a measurement on it, 
etc. For the present discussion, we shall assume that for macro- 
scopic recording systems, this problem does not in fact arise. It is 
not clear whether and why this should be the case. Indeed, one of 
the major unanswered questions concerning quantum mechanics is 
why macroscopic systems enter into definite states, corresponding 
to definite outcomes, rather than remain as mixtures of several 
states, whose interpretation is uncertain. 

Using this notion of measurement, the prospects seem brighter 
for reviving within quantum mechanics something like the standard 
notion of the past. A given physical system will have interacted on 
many past occasions with recording systems. By examining the pres- 
ent traces of these interaction (the records) we can reconstruct some 
features of the past conditions of the physical system. This recon- 
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struction will not give us a continuous past, but at least it will allow 
us to make some definite statements about the system at earlier 
times. To see how this can be done in at least some instances, con- 
sider a situation similar to that first described in a celebrated paper 
written in 1935 by Einstein, Podolsky, and Rosen.!® Those authors 
were trying to bury quantum mechanics rather than to help it, but 
some of their analysis can nevertheless be used for the latter pur- 
pose. They considered two systems that interact leaving some joint 
property, such as total angular momentum, of the combined system 
known because of some conservation law that is satisfied during the 
interaction. Neither system, however, has a definite value for its part 
of the joint quantity, that is, neither system is in a pure state. The 
systems then separate and do not interact any further. At a later 
time, by making a measurement on one of the systems, its value of 
the joint quantity can be determined, and the value of that quantity 
inferred for the other system by subtraction from its known value 
for the total system. If the second system is sufficiently simple, this 
inference is then sufficient to determine its state, without having 
made any direct measurement on it. Einstein, Podolsky, and Rosen 
thought that it was highly suggestive that this state characterized the 
second system at all times after the two systems stopped interacting. 
This conclusion is warranted when the measurement on system 1 is a 
completed measurement, that is, when there is no longer a pure 
state for the joint system. Einstein, Podolsky, and Rosen were con- 
cerned with a somewhat different situation in which the joint system 
remains in a pure state, and there is still an option of what measure- 
ment to carry out on system 1. In that case, the seemingly innocent 
conclusion that system 2 has been in the pure state after the interac- 
tion leads to results that contradict experiment. But that problem 
does not arise in the case that we are considering, for example, if 
system 1 is a macroscopic recording instrument. By the mysterious 
process alluded to earlier through which things happen in quantum 
mechanics, the situation for this instrument has become definite and 
so has that for the system with which it once interacted. Therefore, 
it seems possible by this approach to infer some things about the 
past of physical systems from measurements done in the present. As 
an example, if the measuring device (system 1) is one from which we 
can infer the position of an object that interacts with it, then, if such 
an interaction has taken place in the past, we can infer the past 


13 Albert Einstein, Nathan Rosen, and Bons Podolsky, Physical Review, XLVII 
(1935): 777-80. 
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position of the object by looking at the measuring instrument now. 
Then by use of the evolution equation for the object, we can infer 
the value of the evolved position at times subsequent to that interac- 
tion, and before the next interaction with a recording system. We 
cannot infer anything definite about quantities other than the 
evolved position at these times, however. It would be helpful in our 
thinking about retrodiction to recast quantum mechanics into a 
form that refers to such “happenings,” rather than to the state 
vectors of the usual formalism. The beginnings of such an approach 
have been made by Griffiths and Omnes,’* but it remains to be 
carried through in detail. 

X. WHAT WILL EVER BE KNOWN ABOUT THE PAST AND FUTURE? 
On the basis of these analyses of what can be observed, predicted, 
and retrodicted according to quantum mechanics, we can draw 
some general conclusions about what that theory says will ever be 
known about the world. Imagine that we somehow were in posses- 
sion of a list of all the observations that have ever or will ever be 
done on the world. By observations we mean interactions with re- 
cording instruments. From that list of observations, together with 
the laws of quantum mechanics, we can make inferences about 
things that happened “in-between” the observations. For example, 
from the information obtained from the Nth observation, we can 
infer what the probability would have been for various results of an 
additional measurement, if that additional measurement had been 
made before the N + Ist one was. This probability can become a 
certainty if the observation is of the time evolution of the quantity 
observed last previously. What we can know in this way is obviously 
much less complete than the corresponding result in Newtonian 
mechanics, where we can infer the actual situation at times in be- 
tween measurements. But, in the words of the poet, it is “all we will 
know on earth.” Whether it is all we “need to know” is another 
matter, one of response to limits to knowledge, which is a normative 
rather than a cognitive question. 

GERALD FEINBERG 
SHAUGHAN LAVINE 
DAVID ALBERT 
Columbia University 


14 R Griffiths, Journal of Statsstscal Physics, XXXVI (1984), p. 219; R. Omnes, 
Annals of Physics, CCI (1990), p. 354. 
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COMMENTS AND CRITICISM 


IS IT LESS WRONG TO HARM THE VULNERABLE 
THAN THE SECURE?* 


ichard Brook! has recently offered an account of the dis- 

tinction between “our common (deontological) morality” 

and consequentialism. He presents the deontologist as say- 
ing this: 

The morally significant feature of not being threatened at a moment 

demands some special protection, a protection embodied in the right 

not to be harmed (203). 

This morally significant feature, which is possessed by some moral 
patients but not others, grounds a moral duty. It is more wrong, 
Brook argues, to harm someone who is already threatened than 
someone who is not so threatened. To be threatened is to be such 
that (roughly) “no causal chain has been initiated which, if unin- 
terrupted, will imminently lead to its harm” (202). Brook offers this 
both as something the deontologist would say, and something he 
himself believes. The consequentialist, on the other hand, will ac- 
cept no such distinction, but will prefer to channel harm in whatever 
direction will produce the best consequences. Brook motivates his 
account with an example: 

You are at the zoo with two children who are making a scene. Becom- 

ing angry, you toss them into the lion’s den. Horrified, you come to 

your senses and notice that they can only be saved if you toss a third 
child (who just toddled along) into the back of the den. The beast 
would be distracted and you could quickly leap in and save the first 
two. What should you do? ... it seems clearly wrong to lull the third 
child . as wrong, in fact, as in any of the standard cases where you 

might harm one person to aid others (197-8). 

I do not wish to take a stand on whether Brook is correct in 
thinking that it is wrong to kill the third child, but I agree that it is 
probably wrong if the standard cases of killing one to save others are 
wrong. For present purposes, what matters is that the deontologist 
will say that it is wrong, and I am interested in Brook’s account as a 
principle from which to derive deontological intuitions. His account 
gets the correct answer from the deontological point of view. When 
the parent considers throwing in the third child, he is not threat- 


* I am indebted to Daniel Howard-Snyder for help with this pa 
l In “Agency and Morality,” this JOURNAL, LXXXVIII, 4 (joni 1991): 190- 
212 


0022-362X/92/8912/643—7 © 1992 The Journal of Philosophy, Inc 


644 THE JOURNAL OF PHILOSOPHY 


ened, whereas the first two are. If those who are unthreatened are 
worthy of some special protection not granted to the threatened, 
then it is wrong to throw in the third child, even if doing so is 
necessary to save his siblings. 
I 
If Brook is right, there should be no moral dilemmas to which his 
account gives one answer, while deontologists confidently and unani- 
mously give the opposite answer. In this section, I argue that there 
are a number of such dilemmas, and that a different account (which 
I call the positive/negative account) explains the data better than 
Brook’s. 
Consider the following two cases: 


I. Jack has a kind of cancer which makes it necessary to remove his 
arm. In other words, he is threatened with losing his arm. Now suppose 
(to return to a familiar example) a doctor working in the hospital real- 
izes that Jack’s organs could be used to save the lives of five healthy 
patients. Is it permissible for her to chop Jack up and take his organs? 
Is it more permissible to chop him up than to chop up a completely 
healthy patient, Jill, who has come to the hospital for a yearly 
check-up? 

II. Alice and Bob are going on separate trips into the desert. The 
terrain they will cover is not particularly hostile, except that it is ex- 
tremely hot and there is no water available. If a traveler has two bottles 
of water, there is a very high probability he will survive the trip. If he 
has one bottle, there is a fifty-fifty chance he will survive; and if he has 
no water, he will certainly die. Suppose that unbeknown to Alice, one 
of her bottles has a leak and all the water has run out. Now suppose 
that you are walking through the desert dying of thirst. You find Alice 
lying asleep and consider drinking the water in her bottle. Realizing 
that this is all the water she has, you feel uncomfortable about drinking 
it and walk off to wrestle with your conscience. A little while later you 
come across Bob, and notice that he has two bottles. You are deter- 
mined to steal one of the three bottles, but you debate with yourself 
which action would be least wrong. Would it be worse to drink Alice's 
water or to drink Bob’s? 


No deontologist worthy of the name would say that it is more accept- 
able, in the first case, to chop up Jack than to chop up Jill. Nor 
would he say, with respect to the second case, that it is more accept- 
able for you to drink Alice’s water than to drink Bob’s. That is, in 
these two cases, a deontologist would not agree that it was more 
acceptable to harm the person who is already threatened than the 
person who is not. 

But Brook’s account leads us to believe that this is exactly what 
the deontologist would say. For example, in the second case, it 
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seems as if Brook should say that it is preferable to drink Alice’s 
water, because if you were to drink Bob’s you would be creating a 
threat of death for someone who is otherwise unthreatened; 
whereas if you drink Alice’s water you are simply harming someone 
who is already threatened.” 

Brook has missed the point of his own examples. It seemed, in his 
lion’s den example, that it was less wrong to allow the children I just 
threw into the lion’s den to die than to kill another child to save 
them. In my example, however, it seemed that it was not less wrong 
to kill the man whose arm was about to be amputated than to kill the 
untbreatened bystander. (And this is so in spite of the fact that the 
man whose arm is about to be amputated is already threatened with 
some harm.) 

Killing Jack to get his organs has something in common with al- 
lowing the two children to be devoured by the lion: each is a case of 
choosing that a harm come to someone who is threatened rather 
than someone who is not threatened. If Brook were correct, we 
should say the same about the hospital case as we say about the lion 
case: that it is better for us to choose that the threatened person or 
people suffer. But we do not. Our intuitions about the cases differ. 

Killing Jack to get his organs differs from allowing the two chil- 
dren to be devoured by the lion in that the first is a case of a doing, 
and the second is a mere allowing. Is it not reasonable to suppose 
that that is what explains the difference in our intuitions about 
the cases? 

It will no doubt be objected that there are other differences be- 
tween the cases that might explain the differences in our reactions. 
In Brook’s case, the agent was forced to choose between letting the 
two children be killed by the lion and throwing in the third child. In 
the case I described, the doctor is not forced to choose between 
chopping up Jack and chopping up Jill. There is a third choice. She 
can chop up neither. This difference is itself partly a function of the 
fact that Brook’s example involves a choice between doing and al- 
lowing (there is no third alternative to doing P or not doing P), 


? Being at risk of imminent death does not quite fit Brook's definition of being 
threatened, since it is not clear that there is a causal chain which, if unin- 
terrupted, will lead to one’s death in this case. But I think it would be fair to 

Brook’s point to treat this as a threat. We can see this by ting his 
original example tly. Su it is less than certain that the Lon eat the 
children. Does it follow that are unthreatened, and if unthreatened no lese 
worthy of special protection than the other child? It is extremely implausible to 
say that it is permissible to throw in the youngest child in the case in which it is 
less than certain that the other two will be eaten, whereas it is impermissible in the 
case in which it us certain that they will be eaten 
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whereas mine involves a choice between doing one thing and doing 
another. 

The doctor in my example is forced, however, to choose one of 
the following three options: chopping up Jack, chopping up Jill, and 
failing to save the five who are suffering from organ failure. If she 
ought not to chop up Jack or Jill, then Brook is still in trouble. His 
theory should not have the result that it is worse that harm happen 
to Jack, someone who is threatened, than that a comparable harm 
happen to five other threatened people. 

There is one more difference between chopping up Jack and let- 
ting the children be eaten by the lion. Jack is not threatened with 
death, whereas the children are. Maybe Brook’s point should be that 
those who are unthreatened in a certain respect deserve special pro- 
tection from threats or harms of a comparable kind. 

But suppose that instead of wanting to chop up Jack, the doctor 
wants to amputate his healthy arm to use it in her experiments. The 
threat of losing one’s left arm is comparable to the threat of losing 
one’s right arm. So, if some considerable evil can be averted by 
amputating Jack’s arm, Brook should say that the doctor may do so. 
But the deontologist would not agree, I conjecture. Or consider the 
choice between drinking Alice’s water and drinking Bob’s. Raising 
the probability of someone’s dying of thirst from .5 to 1 is compara- 
ble to raising it from 0 to .5. If Brook is right, it should be slightly 
more acceptable to increase the chance of Alice’s dying than to 
increase the chance that Bob will die. But it does not seem better to 
drink Alice’s water than to drink Bob’s. 

The only difference between Brook’s example and mine which 
explains the difference in our intuitions about them is that his is a 
case of allowing harm and mine are cases of doing harm. It appears, 
therefore, that the principle that it is worse to kill than to let die, or 
to harm than to allow to be harmed (call this the positive /negative 
account), explains the intuitive data considerably better than 
Brook’s account. 

Ul 
If I am right, and the positive/negative account really does get at the 
essence of deontology better than the “‘threatened/nonthreatened”’ 
account that Brook offers, then the positive/negative account 
should get the deontologically correct answer in the lion’s den case. 

Unfortunately, it does not. The positive/negatjve account will say 
that it is worse to kill than to let die. Assuming that it is worse to kill 
twice than to kill once, it will tell the parent to throw his youngest 
child into the lion’s den, because in doing so he will be killing one 
child, but thereby preventing two killings of which he would other- 
wise be the agent. He is faced with a choice not between killing one 
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and letting two die, but between killing one and killing two. Even if 
we deny the plausible assumption that killing one is less wrong than 
killing two, it remains at least permissible to throw the youngest 
child in. Killing one cannot be more wrong than killing two. 

Brook makes this point himself, and offers the advocate of the 
positive /negative account this amendation: “at a given moment one 
is more responsible not to cause harm than to allow harm to occur” 
(194). The important difference here is the emphasis on ‘at a given 
moment’. This new version does get the correct result in the lion’s 
den case. The agent has to choose between now killing his youngest 
child, and it being true that he killed the older two in the past. 
Another way to describe his choice is this: now causing the death of 
his youngest, as opposed to now allowing his other children to die. 
This account says that it would be wrong for him now to cause the 
death of anyone. So, it would be wrong to throw in the little boy. 

Brook prefers his own account: 

It is the moral significance of that [unthreatened] status that accounts 

for the fact that the agent’s duty must be temporally indexed. Other- 

wise, we would wonder why a person especially concerned with her own 
agency would not wish to minimize her own wrongdoing overall i.e., in 

a temporally unrestricted way) (210). 

His point is that either the temporally indexed version of the posi- 
tive /negative account is correct but supervenient on his account, or 
it is incorrect. He believes that that account, offered by itself and 
not as derivative from some more fundamental view, would be ad 
hoc and hence implausible. 

It is hard to know what weight to give such a consideration. It is 
true that as we construct theories we are concerned that our princi- 
ples not be ad hoc or overly complicated. But we are more con- 
cerned that our theories account for important data. The temporally 
indexed version of the positive negative account does that, and does 
so considerably better than Brook’s account. 

The temporally indexed aspect of the positive /negative distinction 
does seem a bit ad hoc. It does not seem very plausible all by itself as 
a moral principle, aside from the fact that it gets the correct answers 
in a number of cases. But then Brook’s own account does not look 
very plausible all by itself. It is not very plausible that being unthreat- 
ened should give someone a right not to be threatened or harmed. 
Would it not be rather odd if morality told us that the weak and 
vulnerable were more deserving of abuse than the strong and 
secure? 

FRANCES HOWARD-SNYDER 
Purdue University 
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BOOK REVIEWS 


Constructibility and Mathematical Existence. CHARLES $. CHIHARA. 
New York: Oxford University Press, 1990. xv, 282 p. Cloth $39.95. 


Charles S. Chihara concurs with most contemporary philosophers of 
mathematics that Platonism about mathematical objects faces seri- 
ous epistemic problems. From here his disagreement with them is 
deep and far reaching—even to the extent developing in “Part I, 
The Constructibility Theory,” ‘‘another kind of mathematics” (23). 
He supports this seemingly radical course in ‘‘Part II, Philosophical 
Developments,” where while criticizing contemporary views he ar- 
gues that his own avoids the problems upon which they founder. 
This scrutiny covers various realisms (W. V. Quine, Kurt Gédel, J. 
Burgess, and Penelope Maddy), structuralism (S. Shapiro and my- 
self), Hartry Field’s nominalism, and Philip Kitcher’s ideal con- 
structor. It also provides a valuable survey of much current philo- 
sophy of mathematics. I shall forgo summarizing this or evaluating 
his criticisms so that I can concentrate upon his positive views. 
Chihara’s “alternative mathematics” replaces set-existence asser- 
tions by claims that open sentences are constructible. For this to 
work, ordinary mathematics must be formulated within the simple 
theory of types. This allows us to map ‘Individual x belongs to a type 
1 class A’ onto a Chihara sentence ‘x satisfies F’, where F is an open 
sentence token with extension A. We map ‘A belongs to the type 2 
class X’ onto ‘Token F satisfies the open sentence token K’, where 
the extension of K is X, and so on. Replacing classes of higher type 
by open sentences raises problems with identities, since classes are 
extensional. Chihara handles this by restricting the domain of open 
sentences to those for which interchange of extensionally equivalent 
arguments holds. (This also deflects problems with quantifying into 
opaque contexts.) He works out the formal details in two chapters 
—essentially transferring simple type theory into the new setting. 
Satisfaction is a primitive here; for obvious reasons, he does not 
define it in terms of classes. He admits that this means that his 
ideology is not fully nominalist. His ontology remains so, however, 
since his constructibility claims concern sentence tokens, not their 
types. Although he bans possible worlds as abstract entities, he uses 
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the heuristic of thinking of constructibility claims as saying that the 
open sentences in question are tokened in some possible world. 
Furthermore, the people doing the tokening in those worlds might 
use a language that we could not grasp or understand. This is sup- 
posed to allow constructibility theory to produce correlates of un- 
countable infinities. 

A chapter shows how the Frege-Russell cardinal numbers can be 
recovered straightforwardly within constructibility theory. Its suc- 
cessor contains a novel reconstruction of the theory of measurable 
quantities, using ordinary quantifiers ranging over one-dimensional 
objects having length. Chihara does not commit himself to these 
“strange entities” (102) but rather offers his theory as an idealiza- 
tion. Aside from its intrinsic interest, Chihara seems to have in- 
cluded this theory in order to show that, contrary to the currently 
influential Quine-Putnam indispensability argument, we need not 
refer to mathematical entities in doing science. Now, I grant that we 
can substitute idealized entities for real valued functions. But I 
wonder whether mathematics will intrude again in applying this 
theory. Henry Kyburg’s Theory and Measurement, which treats 
quantities along lines similar to Chihara’s, gives a detailed analysis of 
the inferences required to apply them in measurement. His account 
still requires a statistical error theory based upon real analysis. 

Of course, an error theory of this sort can be developed within 
Chihara’s system or an extension of it. Thus, he can still reject indis- 
pensability arguments, as he also does, on the ground that in doing 
science we can avoid committing ourselves to abstract mathematical 
objects by interpreting mathematics in terms of constructibility 
theory. 

This raises the question of the relationship between Chihara’s 
system and ordinary mathematics. Chihara tries repeatedly to clarify 
this, but I remain puzzled. In calling his an alternative mathematics 
he refers approvingly to A. Heyting (23), but he is not out to reform 
mathematical practice. To accommodate it he has even relaxed the 
less controversial version of constructibility presented in his pre- 
vious book, Ontology and the Vicious Circle Principle. Furthermore, 
he argues against Field that standard mathematics ‘“‘does contain 
much truth” (173), and states that it is consistent for him to hold 
that it is at least approximately true if properly construed. In illus- 
trating this thesis, he claims that “what is essentially Lebesque’s 
existence theorem” can be proven within constructbility theory, 
thereby showing it to be “correct and sound” (258). This fits poorly 
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with his denial that he is analyzing the meaning of mathematical 
statements. For by itself Chihara’s mapping of standard mathemat- 
ics into constructibility theory shows only that the former is consis- 
tent relative to the latter. The mapping must preserve more for us to 
conclude that it establishes that mathematics is also (approximately) 
true. Chihara does not tell us what this is. Nor do I see how he can 
without facing a number of familiar difficulties confronting reduc- 
tionist programs—ones that he emphasizes do not touch his own. 
These include a version of Paul Benacerraf’s multiple reduction 
problem, since type theory still leaves us many options for defining, 
say, the reals. 

In criticizing Burgess’s realism, Chihara says: “I want to treat the 
question of the existence of mathematical entities as a scientific 
question . .. Where is the evidence that supposedly supports belief 
in these entities? Here, I mean evidence of the sort that would con- 
vince the physicist or biologist” (189). This prompts me to wonder 
about the evidence we might seek for constructibility claims. Take 
the case of the axiom of choice. Its correlate in Chihara’s system 
denies that an open sentence is constructible for which no open 
sentence serving as its “choice function” is constructible. Although 
this is not one of his axioms, there is good reason for including it, 
for the usual axiom is now standard mathematics. The consider- 
ations mathematicians have raised historically for and against it, 
however, are irrelevant to Chihara’s version. He has already dis- 
avowed the constructivist objection that specifying some choice 
functions might require means that we could grasp. Nor can mathe- 
matical fruitfulness count in its favor, since that has nothing to do 
with the sorts of open sentences that might be tokened. Rather, to 
seek evidence bearing upon the Chihara principle we would need to 
turn to biology, anthropology, and linguistics in order to determine 
what sorts of languages possible humans might have. Furthermore, 
we should be wary of idealizations of ourselves; Chihara complains 
that Kitcher’s ideal constructor “appears more godlike than hu- 
man” (243). 

We would find, I suspect, that the open sentences possible hu- 
mans might token are limited by the complexity of their extensions. 
This restriction might make Chihara’s choice principle hold trivially, 
just as restrictions on the domain of intuitionist mathematics yield 
trivial versions of the axiom. But if so, constructibility theory would 
also be limited, and probably could not provide proxies for infinitely 
many complicated sets of integers. In standard mathematics, we 
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sometimes leap over barriers due to complexity by appealing to pure 
existence theorems. Thus we know, for instance, that uncountably 
many sets of integers are not definable in the notation of ZF set 
theory. If the system is consistent, then some of these are extensions 
of “truth predicates” for ZF. But we can easily add such a predicate, 
by adding for each numeral an axiom of the form ‘Tn if and only if 
S’, where n Gödel numbers S under some Gödel numbering we 
define in ZF in advance. One can do something analogous in con- 
structibility theory. But this only postpones the problem. We cannot 
define all the Gödel numberings theoretically definable in ZF; there 
are infinitely many and some are too complicated. Despite this, our 
mathematics tells us that they exist. Can the constructibility theorist 
say something similar? Until we know more about how complexity 
limits constructibility, we just cannot say. 

Perhaps I have misunderstood Chihara’s theory, for I have taken 
constructibility theory as concerned with what it is logically possible 
for possible humans to do. By contrast, it seems to me that in ap- 
proaching mathematics in constructive terms the properties of the 
operations generating possible constructions are of essence, not 
those of the agents using them. This makes constructibility a purely 
mathematical matter rather than biolinguistic one. 

However these and other questions concerning Chihara’s theory 
are resolved, readers will find much of interest in this book. 


MICHAEL D. RESNIK 


University of North Carolina/Chapel Hill 
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